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1. CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part application of U.S. Application Serial 
No. 09/560,875, filed April 27, 2000, which in turn is a continuation-in-part 
application of U.S. Application Serial No. 09/496,914, filed February 03, 2000, both 
of which are incorporated herein by reference in their entirety. 

2. BACKGROUND OF THE INVENTION 

2.1 TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by 
such polynucleotides, along with uses for these polynucleotides and proteins, for 
example in therapeutic, diagnostic and research methods. 

2.2 BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, 
such as lymphokines, interferons, CSFs, chemokines, and interleukins) has matured 
rapidly over the past decade. The now routine hybridization cloning and expression 
cloning techniques clone novel polynucleotides "directly" in the sense that they rely on 
information directly related to the discovered protein (i.e., partial DNA/amino acid 
sequence of the protein in the case of hybridization cloning; activity of the protein in 
the case of expression cloning). More recent "indirect" cloning techniques such as 
signal sequence cloning, which isolates DNA sequences based on the presence of a now 
well-recognized secretory leader sequence motif, as well as various PCR-based or low 
stringency hybridization-based cloning techniques, have advanced the state of the art by 
making available large numbers of DNA/amino acid sequences for proteins that are 
known to have biological activity, for example, by virtue of their secreted nature in the 
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case of leader sequence cloning, by virtue of their cell or tissue source in the case of 
PCR-based techniques, or by virtue of structural similarity to other genes of known 
biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications 
in, for example, diagnostics, forensics, gene mapping; identification of mutations 
responsible for genetic disorders or other traits, to assess biodiversity, and to produce 
many other types of data and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, 
novel isolated polynucleotides encoding such polypeptides, including recombinant DNA 
molecules, cloned genes or degenerate variants thereof, especially naturally occurring 
variants such as allelic variants, antisense polynucleotide molecules, and antibodies that 
specifically recognize one or more epitopes present on such polypeptides, as well as 
hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including 
expression vectors, containing the polynucleotides of the invention, cells genetically 
engineered to contain such polynucleotides and cells genetically engineered to express 
such polynucleotides. 

The present invention relates to a collection or library of at least one novel nucleic 
acid sequence assembled from expressed sequence tags (ESTs) isolated mainly by 
sequencing by hybridization (SBH), and in some cases, sequences obtained from one or 
more public databases. The invention relates also to the proteins encoded by such 
polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. These nucleic acid sequences are designated as SEQ ID 
NO: 1-419 and are provided in the Sequence Listing. In the nucleic acids provided in 
the Sequence Listing, A is adenosine; C is cytosine; G is guanosine; T is thymine; and N 
is any of the four bases. In the amino acids provided in the Sequence Listing, * 
corresponds to the stop codon. 
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The nucleic acid sequences of the present invention also include, nucleic acid 
sequences that hybridize to the complement of SEQ ID NO: 1 - 419 under stringent 
hybridization conditions; nucleic acid sequences which are allelic variants or species 
homologues of any of the nucleic acid sequences recited above, or nucleic acid sequences 
5 that encode a peptide comprising a specific domain or truncation of the peptides encoded 
by SEQ ID NO: 1-419. A polynucleotide comprising a nucleotide sequence having at 
least 90% identity to an identifying sequence of SEQ ID NO: 1 - 419 or a degenerate 
variant or fragment thereof. The identifying sequence can be 100 base pairs in length. 
The nucleic acid sequences of the present invention also include the sequence 

10 information from the nucleic acid sequences of SEQ ID NO: 1-419. The sequence 

information can be a segment of any one of SEQ ID NO: 1-419 that uniquely identifies 
or represents the sequence information of SEQ ID NO: 1 - 419. 

A collection as used in this application can be a collection of only one 
polynucleotide. The collection of sequence information or identifying information of each 

15 sequence can be provided on a nucleic acid array. In one embodiment, segments of 

sequence information is provided on a nucleic acid array to detect the polynucleotide that 
contains the segment. The array can be designed to detect full-match or mismatch to the 
polynucleotide that contains the segment. The collection can also be provided in a 
computer-readable format. 

20 This invention also includes the reverse or direct complement of any of the nucleic 

acid sequences recited above; cloning or expression vectors containing the nucleic acid 
sequences; and host cells or organisms transformed with these expression vectors. 
Nucleic acid sequences (or their reverse or direct complements) according to the invention 
have numerous applications in a variety of techniques known to those skilled in the art of 

25 molecular biology, such as use as hybridization probes, use as primers for PCR, use in an 
array, use in computer-readable media, use in sequencing full-length genes, use for 
chromosome and gene mapping, use in the recombinant production of protein, and use in 
the generation of anti-sense DNA or RNA, their chemical analogs and the like. 

In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-419 or 

30 novel segments or parts of the nucleic acids of the invention are used as primers in 
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expression assays that are well known in the art. In a particularly preferred embodiment, 
the nucleic acid sequences of SEQ ID NO: 1-419 or novel segments or parts of the nucleic 
acids provided herein are used in diagnostics for identifying expressed genes or, as well 
known in the art and exemplified by Vollrath et al., Science 258:52-59 (1992), as 
5 expressed sequence tags for physical mapping of the human genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in the SEQ ID 
NO: 1-419; a polynucleotide comprising any of the full length protein coding sequences 
of the SEQ ID NO: 1-419; and a polynucleotide comprising any of the nucleotide 

10 sequences of the mature protein coding sequences of the SEQ ID NO: 1-419. The 
polynucleotides of the present invention also include, but are not limited to, a 
polynucleotide that hybridizes under stringent hybridization conditions to (a) the 
complement of any one of the nucleotide sequences set forth in the SEQ ID NO: 1-419; 
(b) a nucleotide sequence encoding any one of the amino acid sequences set forth in the 

15 Sequence Listing; (c) a polynucleotide which is an allelic variant of any polynucleotides 
recited above; (d) a polynucleotide which encodes a species homolog (e.g. orthologs) of 
any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of any of the polypeptides comprising an 
amino acid sequence set forth in the Sequence Listing. 

20 The isolated polypeptides of the invention include, but are not limited to, a 

polypeptide comprising any of the amino acid sequences set forth in the Sequence Listing; 
or the corresponding full length or mature protein. Polypeptides of the invention also 
include polypeptides with biological activity that are encoded by (a) any of the 
polynucleotides having a nucleotide sequence set forth in the SEQ ID NO: 1-419; or (b) 

25 polynucleotides that hybridize to the complement of the polynucleotides of (a) under 
stringent hybridization conditions. Biologically or immunologically active variants of any 
of the polypeptide sequences in the Sequence Listing, and "substantial equivalents" thereof 
(e.g., with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino 
acid sequence identity) that preferably retain biological activity are also contemplated. The 

30 polypeptides of the invention may be wholly or partially chemically synthesized but are 



787CIP2B 



4 



preferably produced by recombinant means using the genetically engineered cells (e.g. 
host cells) of the invention. 

The invention also provides compositions comprising a polypeptide of the 
invention. Polypeptide compositions of the invention may further comprise an 
acceptable carrier, such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

The invention also provides host cells transformed or transfected with a 
polynucleotide of the invention. 

The invention also relates to methods for producing a polypeptide of the 
invention comprising growing a culture of the host cells of the invention in a suitable 
culture medium under conditions permitting expression of the desired polypeptide, and 
purifying the polypeptide from the culture or from the host cells. Preferred 
embodiments include those in which the protein produced by such process is a mature 
form of the protein. 

Polynucleotides according to the invention have numerous applications in a 
variety of techniques known to those skilled in the art of molecular biology. These 
techniques include use as hybridization probes, use as oligomers, or primers, for PCR, 
use for chromosome and gene mapping, use in the recombinant production of protein, 
and use in generation of anti-sense DNA or RNA, their chemical analogs and the like. 
For example, when the expression of an mRNA is largely restricted to a particular cell 
or tissue type, polynucleotides of the invention can be used as hybridization probes to 
detect the presence of the particular cell or tissue mRNA in a sample using, e.g., in situ 
hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
expressed sequence tags for identifying expressed genes or, as well known in the art 
and exemplified by Vollrath et al. } Science 258:52-59 (1992), as expressed sequence 
tags for physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of 
conventional procedures and methods that are currently applied to other proteins. For 
example, a polypeptide of the invention can be used to generate an antibody that 
specifically binds the polypeptide. Such antibodies, particularly monoclonal antibodies, 
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are useful for detecting or quantitating the polypeptide in tissue. The polypeptides of 
the invention can also be used as molecular weight markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical 
condition which comprises the step of administering to a mammalian subject a 
therapeutically effective amount of a composition comprising a polypeptide of the 
present invention and a pharmaceutically acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be 
utilized, for example, in methods for the prevention and/or treatment of disorders 
involving aberrant protein expression or biological activity. 

The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for 
example, be utilized as part of prognostic and diagnostic evaluation of disorders as 
recited herein and for the identification of subjects exhibiting a predisposition to such 
conditions. The invention provides a method for detecting the polynucleotides of the 
invention in a sample, comprising contacting the sample with a compound that binds to 
and forms a complex with the polynucleotide of interest for a period sufficient to form 
the complex and under conditions sufficient to form a complex and detecting the 
complex such that if a complex is detected, the polynucleotide of interest is detected. 
The invention also provides a method for detecting the polypeptides of the invention in 
a sample comprising contacting the sample with a compound that binds to and forms a 
complex with the polypeptide under conditions and for a period sufficient to form the 
complex and detecting the formation of the complex such that if a complex is formed, 
the polypeptide is detected. 

The invention also provides kits comprising polynucleotide probes and/or 
monoclonal antibodies, and optionally quantitative standards, for carrying out methods 
of the invention. Furthermore, the invention provides methods for evaluating the 
efficacy of drugs, and monitoring the progress of patients, involved in clinical trials for 
the treatment of disorders as recited above. 

The invention also provides methods for the identification of compounds that 
modulate (i.e., increase or decrease) the expression or activity of the polynucleotides 
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and/or polypeptides of the invention. Such methods can be utilized, for example, for 
the identification of compounds that can ameliorate symptoms of disorders as recited 
herein. Such methods can include, but are not limited to, assays for identifying 
compounds and other substances that interact with {e.g. , bind to) the polypeptides of the 
5 invention. The invention provides a method for identifying a compound that binds to 
the polypeptides of the invention comprising contacting the compound with a 
polypeptide of the invention in a cell for a time sufficient to form a 
polypeptide/compound complex, wherein the complex drives expression of a reporter 
gene sequence in the cell; and detecting the complex by detecting the reporter gene 

10 sequence expression such that if expression of the reporter gene is detected the 
compound the binds to a polypeptide of the invention is identified. 

The methods of the invention also provides methods for treatment which involve 
the administration of the polynucleotides or polypeptides of the invention to individuals 
exhibiting symptoms or tendencies. In addition, the invention encompasses methods for 

15 treating diseases or disorders as recited herein comprising administering compounds 
and other substances that modulate the overall activity of the target gene products. 
Compounds and other substances can effect such modulation either on the level of 
target gene/protein expression or target protein activity. 

The polypeptides of the present invention and the polynucleotides encoding them 

20 are also useful for the same functions known to one of skill in the art as the 

polypeptides and polynucleotides to which they have the closest homology (set forth in 
Table 1). If no homology is set forth for a sequence, then the polypeptides and 
polynucleotides of the present invention are useful for a variety of applications, as 
described herein, including use in arrays for detection. 

25 

4. DETAILED DESCRIPTION OF THE INVENTION 
4.1 DEFINITIONS 
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It must be noted that as used herein and in the appended claims, the singular 
forms "a", "an" and "the" include plural references unless the context clearly dictates 
otherwise. 

The term "active" refers to those forms of the polypeptide which retain the 
5 biologic and/or immunologic activities of any naturally occurring polypeptide. 
According to the invention, the terms "biologically active" or "biological activity" 
refer to a protein or peptide having structural, regulatory or biochemical functions of a 
naturally occurring molecule. Likewise "immunologically active" or "immunological 
activity" refers to the capability of the natural, recombinant or synthetic polypeptide to 
10 induce a specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The term "activated cells" as used in this application are those cells which are 
engaged in extracellular or intracellular membrane trafficking, including the export of 
secretory or enzymatic molecules as part of a normal or disease process. 

1 15 The terms "complementary" or "complementarity" refer to the natural binding 

of polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5\ Complementarity between two single-stranded 
molecules may be "partial" such that only some of the nucleic acids bind or it may be 
"complete" such that total complementarity exists between the single stranded 

20 molecules. The degree of complementarity between the nucleic acid strands has 
significant effects on the efficiency and strength of the hybridization between the 
nucleic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term 

25 "germ line stem cells (GSCs)" refers to stem cells derived from primordial stem cells 
that provide a steady and continuous source of germ cells for the production of 
gametes. The term "primordial germ cells (PGCs)" refers to a small population of 
cells set aside from other cell lineages particularly from the yolk sac, mesenteries, or 
gonadal ridges during embryogenesis that have the potential to differentiate into germ 

30 cells and other cells. PGCs are the source from which GSCs and ES cells are derived 
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The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells not 
only populate the germ line and give rise to a plurality of terminally differentiated cells 
that comprise the adult specialized organs, but are able to regenerate themselves. 
The term "expression modulating fragment," EMF, means a series of 
5 nucleotides which modulates the expression of an operably linked ORF or another 
EMF. 

As used herein, a sequence is said to "modulate the expression of an operably 
linked sequence" when the expression of the sequence is altered by the presence of the 
EMF. EMFs include, but are not limited to, promoters, and promoter modulating 

10 sequences (inducible elements). One class of EMFs are nucleic acid fragments which 
induce the expression of an operably linked ORF in response to a specific regulatory 
factor or physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
"oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides 

15 or the sequence of these nucleotides. These phrases also refer to DNA or RNA of 

genomic or synthetic origin which may be single-stranded or double-stranded and may 
represent the sense or the antisense strand, to peptide nucleic acid (PNA) or to any 
DNA-like or RNA-like material. Generally, nucleic acid segments provided by this 
invention may be assembled from fragments of the genome and short oligonucleotide 

20 linkers, or from a series of oligonucleotides, or from individual nucleotides, to provide 
a synthetic nucleic acid which is capable of being expressed in a recombinant 
transcriptional unit comprising regulatory elements derived from a microbial or viral 
operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", 

25 "portion," or "segment" or "probe" or "primer" are used interchangeable and refer to a 
sequence of nucleotide residues which are at least about 5 nucleotides, more preferably 
at least about 7 nucleotides, more preferably at least about 9 nucleotides, more 
preferably at least about 11 nucleotides and most preferably at least about 17 
nucleotides. The fragment is preferably less than about 500 nucleotides, preferably less 

30 than about 200 nucleotides, more preferably less than about 100 nucleotides, more 
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preferably less than about 50 nucleotides and most preferably less than 30 nucleotides. 
Preferably the probe is from about 6 nucleotides to about 200 nucleotides, preferably 
from about 15 to about 50 nucleotides, more preferably from about 17 to 30 nucleotides 
and most preferably from about 20 to 25 nucleotides. Preferably the fragments can be 
5 used in polymerase chain reaction (PCR), various hybridization procedures or 

microarray procedures to identify or amplify identical or related parts of mRNA or 
DNA molecules. A fragment or segment may uniquely identify each polynucleotide 
sequence of the present invention. Preferably the fragment comprises a sequence 
substantially similar to any one of SEQ ID NOs: 1-419. 
10 Probes may, for example, be used to determine whether specific mRNA 

molecules are present in a cell or tissue or to isolate similar nucleic acid sequences 
from chromosomal DNA as described by Walsh et al. (Walsh, P.S. et al., 1992, PCR 
Methods Appl 1:241-250). They may be labeled by nick translation, Klenow fill-in 
reaction, PCR, or other methods well known in the art. Probes of the present 
; 15 invention, their preparation and/or labeling are elaborated in Sambrook, J. et al., 1989, 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY; or 
Ausubel, F.M. et al., 1989, Current Protocols in Molecular Biology, John Wiley & 
Sons, New York NY, both of which are incorporated herein by reference in their 
entirety. 

*20 The nucleic acid sequences of the present invention also include the sequence 

information from the nucleic acid sequences of SEQ ID NOs: 1-419. The sequence 
information can be a segment of any one of SEQ ID NOs: 1-419 that uniquely identifies 
or represents the sequence information of that sequence of SEQ ID NO: 1-419. One 
such segment can be a twenty-mer nucleic acid sequence because the probability that a 

25 twenty-mer is fully matched in the human genome is 1 in 300. In the human genome, 
there are three billion base pairs in one set of chromosomes. Because 4 20 possible 
twenty-mers exist, there are 300 times more twenty-mers than there are base pairs in a 
set of human chromosome. Using the same analysis, the probability for a seventeen- 
mer to be fully matched in the human genome is approximately 1 in 5. When these 

30 segments are used in arrays for expression studies, fifteen-mer segments can be used. 
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The probability that the fifteen-mer is fully matched in the expressed sequences is also 
approximately one in five because expressed sequences comprise less than 
approximately 5% of the entire genome sequence. 

Similarly, when using sequence information for detecting a single mismatch, a 
5 segment can be a twenty-five mer. The probability that the twenty-five mer would appear 
in a human genome with a single mismatch is calculated by multiplying the probability for 
a full match (l-^4 25 ) times the increased probability for mismatch at each nucleotide 
position (3 x 25). The probability that an eighteen mer with a single mismatch can be 
detected in an array for expression studies is approximately one in five. The probability 
10 that a twenty-mer with a single mismatch can be detected in a human genome is 
approximately one in five. 

The term "open reading frame," ORF, means a series of nucleotide triplets 
coding for amino acids without any termination codons and is a sequence translatable 
into protein. 

15 The terms "operably linked" or "operably associated" refer to functionally 

related nucleic acid sequences. For example, a promoter is operably associated or 
operably linked with a coding sequence if the promoter controls the transcription of the 
coding sequence. While operably linked nucleic acid sequences can be contiguous and 
in the same reading frame, certain genetic elements e.g. repressor genes are not 

20 contiguously linked to the coding sequence but still control transcription/translation of 
the coding sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a 
number of differentiated cell types that are present in an adult organism. A pluripotent 
cell is restricted in its differentiation capability in comparison to a totipotent cell. 

25 The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an 

oligopeptide, peptide, polypeptide or protein sequence or fragment thereof and to 
naturally occurring or synthetic molecules. A polypeptide "fragment," "portion," or 
"segment" is a stretch of amino acid residues of at least about 5 amino acids, preferably 
at least about 7 amino acids, more preferably at least about 9 amino acids and most 

30 preferably at least about 17 or more amino acids. The peptide preferably is not greater 
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than about 200 amino acids, more preferably less than 150 amino acids and most 
preferably less than 100 amino acids. Preferably the peptide is from about 5 to about 
200 amino acids. To be active, any polypeptide must have sufficient length to display 
biological and/or immunological activity. 
5 The term "naturally occurring polypeptide" refers to polypeptides produced by 

cells that have not been genetically engineered and specifically contemplates various 
polypeptides arising from post-translational modifications of the polypeptide including, 
but not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation 
and acylation. 

10 The term "translated protein coding portion" means a sequence which encodes 

for the full length protein which may include any leader sequence or any processing 
sequence. 

The term "mature protein coding sequence" means a sequence which encodes a 
peptide or protein without a signal or leader sequence. The peptide may have been 

15 produced by processing in the cell which removes any leader/signal sequence. The 
peptide may be produced synthetically or the protein may have been produced using a 
polynucleotide only encoding for the mature protein coding sequence. 

The term "derivative" refers to polypeptides chemically modified by such 
techniques as ubiquitination, labeling (e.g., with radionuclides or various enzymes), 

20 covalent polymer attachment such as pegylation (derivatization with polyethylene 
glycol) and insertion or substitution by chemical synthesis of amino acids such as 
ornithine, which do not normally occur in human proteins. 

The term "variant "(or "analog") refers to any polypeptide differing from 
naturally occurring polypeptides by amino acid insertions, deletions, and substitutions, 

25 created using, e g., recombinant DNA techniques. Guidance in determining which 
amino acid residues may be replaced, added or deleted without abolishing activities of 
interest, may be found by comparing the sequence of the particular polypeptide with 
that of homologous peptides and minimizing the number of amino acid sequence 
changes made in regions of high homology (conserved regions) or by replacing amino 

30 acids with consensus sequence. 
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Alternatively, recombinant variants encoding these same or similar polypeptides 
may be synthesized or selected by making use of the "redundancy" in the genetic code. 
Various codon substitutions, such as the silent changes which produce various 
restriction sites, may be introduced to optimize cloning into a plasmid or viral vector or 
5 expression in a particular prokaryotic or eukaryotic system. Mutations in the 
polynucleotide sequence may be reflected in the polypeptide or domains of other 
peptides added to the polypeptide to modify the properties of any part of the 
polypeptide, to change characteristics such as ligand-binding affinities, interchain 
affinities, or degradation/turnover rate. 

10 Preferably, amino acid "substitutions" are the result of replacing one amino acid 

with another amino acid having similar structural and/or chemical properties, i.e. , 
conservative amino acid replacements. "Conservative" amino acid substitutions may be 
made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues involved. For example, 

15 nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, 

proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; positively 
charged (basic) amino acids include arginine, lysine, and histidine; and negatively 
charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or 

20 "deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 
to 10 amino acids. The variation allowed may be experimentally determined by 
systematically making insertions, deletions, or substitutions of amino acids in a 
polypeptide molecule using recombinant DNA techniques and assaying the resulting 
recombinant variants for activity. 

25 Alternatively, where alteration of function is desired, insertions, deletions or 

non-conservative alterations can be engineered to produce altered polypeptides. Such 
alterations can, for example, alter one or more of the biological functions or 
biochemical characteristics of the polypeptides of the invention. For example, such 
alterations may change polypeptide characteristics such as ligand-binding affinities, 

30 interchain affinities, or degradation/turnover rate. Further, such alterations can be 
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selected so as to generate polypeptides that are better suited for expression, scale up 
and the like in the host cells chosen for expression. For example, cysteine residues can 
be deleted or substituted with another amino acid residue in order to eliminate disulfide 
bridges. 

5 The terms "purified" or "substantially purified" as used herein denotes that the 

indicated nucleic acid or polypeptide is present in the substantial absence of other 
biological macromolecules, e.g. , polynucleotides, proteins, and the like. In one 
embodiment, the polynucleotide or polypeptide is purified such that it constitutes at 
least 95% by weight, more preferably at least 99% by weight, of the indicated 
10 biological macromolecules present (but water, buffers, and other small molecules, 
especially molecules having a molecular weight of less than 1000 daltons, can be 
present). 

J The term "isolated" as used herein refers to a nucleic acid or polypeptide 

I separated from at least one other component (e.g., nucleic acid or polypeptide) present 
1 15 with the nucleic acid or polypeptide in its natural source. In one embodiment, the 
] nucleic acid or polypeptide is found in the presence of (if anything) only a solvent, 

buffer, ion, or other component normally present in a solution of the same. The terms 
"isolated" and "purified" do not encompass nucleic acids or polypeptides present in 
, their natural source. 

; 20 The term "recombinant," when used herein to refer to a polypeptide or protein, 

means that a polypeptide or protein is derived from recombinant (e.g., microbial, 
insect, or mammalian) expression systems. "Microbial" refers to recombinant 
polypeptides or proteins made in bacterial or fungal (e.g., yeast) expression systems. 
As a product, "recombinant microbial" defines a polypeptide or protein essentially free 

25 of native endogenous substances and unaccompanied by associated native glycosylation. 
Polypeptides or proteins expressed in most bacterial cultures, e.g., E. coli, will be free 
of glycosylation modifications; polypeptides or proteins expressed in yeast will have a 
glycosylation pattern in general different from those expressed in mammalian cells. 
The term "recombinant expression vehicle or vector" refers to a plasmid or 

30 phage or virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. 
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An expression vehicle can comprise a transcriptional unit comprising an assembly of (1) 
a genetic element or elements having a regulatory role in gene expression, for example, 
promoters or enhancers, (2) a structural or coding sequence which is transcribed into 
mRNA and translated into protein, and (3) appropriate transcription initiation and 
5 termination sequences. Structural units intended for use in yeast or eukaryotic 

expression systems preferably include a leader sequence enabling extracellular secretion 
of translated protein by a host cell. Alternatively, where recombinant protein is 
expressed without a leader or transport sequence, it may include an amino terminal 
methionine residue. This residue may or may not be subsequently cleaved from the 

10 expressed recombinant protein to provide a final product. 

The term "recombinant expression system" means host cells which have stably 
integrated a recombinant transcriptional unit into chromosomal DNA or carry the 
recombinant transcriptional unit extrachromosomally. Recombinant expression systems 
as defined herein will express heterologous polypeptides or proteins upon induction of 

15 the regulatory elements linked to the DNA segment or synthetic gene to be expressed. 
This term also means host cells which have stably integrated a recombinant genetic 
element or elements having a regulatory role in gene expression, for example, 
promoters or enhancers. Recombinant expression systems as defined herein will 
express polypeptides or proteins endogenous to the cell upon induction of the regulatory 

20 elements linked to the endogenous DNA segment or gene to be expressed. The cells 
can be prokaryotic or eukaryotic. 

The term "secreted" includes a protein that is transported across or through a 
membrane, including transport as a result of signal sequences in its amino acid 
sequence when it is expressed in a suitable host cell. "Secreted" proteins include 

25 without limitation proteins secreted wholly (e.g. , soluble proteins) or partially (e.g. , 
receptors) from the cell in which they are expressed. "Secreted" proteins also include 
without limitation proteins that are transported across the membrane of the endoplasmic 
reticulum. "Secreted" proteins are also intended to include proteins containing 
non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, P.A, and Young, 

30 P.R. (1992) Cytokine 4(2): 134 -143) and factors released from damaged cells (e.g. 
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Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. (1998) Annu. Rev. 
Immunol. 16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or 
leader sequence" which will direct the polypeptide through the membrane of a cell. 
Such a sequence may be naturally present on the polypeptides of the present invention 
or provided from heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly 
understood in the art as stringent. Stringent conditions can include highly stringent 
conditions (i.e., hybridization to filter-bound DNA in 0.5 M NaHP04, 7% sodium 
dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and washing in 0.1X SSC/0.1% SDS at 
68°C), and moderately stringent conditions (i.e., washing in 0.2X SSC/0.1% SDS at 
42°C). Other exemplary hybridization conditions are described herein in the examples. 

In instances of hybridization of deoxy oligonucleotides, additional exemplary 
stringent hybridization conditions include washing in 6X SSC/0.05% sodium 
pyrophosphate at 37°C (for 14-base oligonucleotides), 48°C (for 17-base oligos), 55°C 
(for 20-base oligonucleotides), and 60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" can refer both to nucleotide and 
amino acid sequences, for example a mutant sequence, that varies from a reference 
sequence by one or more substitutions, deletions, or additions, the net effect of which 
does not result in an adverse functional dissimilarity between the reference and subject 
sequences. Typically, such a substantially equivalent sequence varies from one of those 
listed herein by no more than about 35% (i.e., the number of individual residue 
substitutions, additions, and/or deletions in a substantially equivalent sequence, as 
compared to the corresponding reference sequence, divided by the total number of 
residues in the substantially equivalent sequence is about 0.35 or less). Such a 
sequence is said to have 65% sequence identity to the listed sequence. In one 
embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies 
from a listed sequence by no more than 30% (70% sequence identity); in a variation of 
this embodiment, by no more than 25% (75% sequence identity); and in a further 
variation of this embodiment, by no more than 20% (80% sequence identity) and in a 
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further variation of this embodiment, by no more than 10% (90% sequence identity) 
and in a further variation of this embodiment, by no more that 5% (95% sequence 
identity). Substantially equivalent, e.g. , mutant, amino acid sequences according to the 
invention preferably have at least 80% sequence identity with a listed amino acid 
sequence, more preferably at least 90% sequence identity. Substantially equivalent 
nucleotide sequences of the invention can have lower percent sequence identities, taking 
into account, for example, the redundancy or degeneracy of the genetic code. 
Preferably, nucleotide sequence has at least about 65% identity, more preferably at 
least about 75 % identity, and most preferably at least about 95 % identity. For the 
purposes of the present invention, sequences having substantially equivalent biological 
activity and substantially equivalent expression characteristics are considered 
substantially equivalent. For the purposes of determining equivalence, truncation of the 
mature sequence (e.g., via a mutation which creates a spurious stop codon) should be 
disregarded. Sequence identity may be determined, e.g., using the Jotun Hein method 
(Hein, J. (1990) Methods Enzymol. 183:626-645). Identity between sequences can also 
be determined by other methods known in the art, e.g. by varying hybridization 
conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of 
the cell types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so 
that the DNA is replicable, either as an extrachromosomal element, or by chromosomal 
integration. The term "transfection" refers to the taking up of an expression vector by a 
suitable host cell, whether or not any coding sequences are in fact expressed. The term 
"infection" refers to the introduction of nucleic acids into a suitable host cell by use of a 
virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of 
nucleotides which mediate the uptake of a linked DNA fragment into a cell. UMFs can 
be readily identified using known UMFs as a target sequence or target motif with the 
computer-based systems described below. The presence and activity of a UMF can be 
confirmed by attaching the suspected UMF to a marker sequence. The resulting nucleic 
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acid molecule is then incubated with an appropriate host under appropriate conditions 
and the uptake of the marker sequence is determined. As described above, a UMP will 
increase the frequency of uptake of a linked marker sequence. 

Each of the above terms is meant to encompass all that is described for each, 
5 unless the context dictates otherwise. 

4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 
The isolated polynucleotides of the invention include a polynucleotide 

10 comprising the nucleotide sequences of the SEQ ID NO: 1 - 419; a polynucleotide 
encoding any one of the peptide sequences of SEQ ID NO:l - 419; and a 
polynucleotide comprising the nucleotide sequence encoding the mature protein coding 
sequence of the polynucleotides of any one of SEQ ID NO: 1-419. The 
polynucleotides of the present invention also include, but are not limited to, a 

15 polynucleotide that hybridizes under stringent conditions to (a) the complement of any 
of the nucleotides sequences of the SEQ ID NO: 1 - 419; (b) nucleotide sequences 
encoding any one of the amino acid sequences set forth in the Sequence Listing; (c) a 
polynucleotide which is an allelic variant of any polynucleotide recited above; (d) a 
polynucleotide which encodes a species homolog of any of the proteins recited above; 

20 or (e) a polynucleotide that encodes a polypeptide comprising a specific domain or 

truncation of the polypeptides of SEQ ID NO: 1-419. Domains of interest may depend 
on the nature of the encoded polypeptide; e.g., domains in receptor-like polypeptides 
include ligand-binding, extracellular, transmembrane, or cytoplasmic domains, or 
combinations thereof; domains in immunoglobulin-like proteins include the variable 

25 immunoglobulin-like domains; domains in enzyme-like polypeptides include catalytic 
and substrate binding domains; and domains in ligand polypeptides include receptor- 
binding domains. 

The polynucleotides of the invention include naturally occurring or wholly or 
partially synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The 
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polynucleotides may include all of the coding region of the cDNA or may represent a 
portion of the coding region of the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences 
disclosed herein. The corresponding genes can be isolated in accordance with known 
5 methods using the sequence information disclosed herein. Such methods include the 
preparation of probes or primers from the disclosed sequence information for 
identification and/or amplification of genes in appropriate genomic libraries or other 
sources of genomic materials. Further 5' and 3' sequence can be obtained using methods 
known in the art. For example, full length cDNA or genomic DNA that corresponds to 
10 any of the polynucleotides of the SEQ ID NO: 1 - 419 can be obtained by screening 

appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using 
any of the polynucleotides of the SEQ ID NO: 1 - 419 or a portion thereof as a probe. 
Alternatively, the polynucleotides of the SEQ ID NO: 1 - 419 may be used as the basis for 
suitable primer(s) that allow identification and/or amplification of genes in appropriate 
j 15 genomic DNA or cDNA libraries. 

The nucleic acid sequences of the invention can be assembled from ESTs and 
sequences (including cDNA and genomic sequences) obtained from one or more public 
databases, such as dbEST, gbpri, and UniGene. The EST sequences can provide 
identifying sequence information, representative fragment or segment information, or 
20 novel segment information for the full-length gene. 

The polynucleotides of the invention also provide polynucleotides including 
nucleotide sequences that are substantially equivalent to the polynucleotides recited 
above. Polynucleotides according to the invention can have, e.g. , at least about 65%, 
at least about 70%, at least about 75%, at least about 80%, more typically at least about 
25 90%, and even more typically at least about 95%, sequence identity to a polynucleotide 
recited above. 

Included within the scope of the nucleic acid sequences of the invention are 
nucleic acid sequence fragments that hybridize under stringent conditions to any of the 
nucleotide sequences of the SEQ ID NO: 1 - 419, or complements thereof, which 
30 fragment is greater than about 5 nucleotides, preferably 7 nucleotides, more preferably 
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greater than 9 nucleotides and most preferably greater than 17 nucleotides. Fragments 
of, e.g. 15, 17, or 20 nucleotides or more that are selective for (i.e. specifically 
hybridize to any one of the polynucleotides of the invention) are contemplated. Probes 
capable of specifically hybridizing to a polynucleotide can differentiate polynucleotide 
5 sequences of the invention from other polynucleotide sequences in the same family of 
genes or can differentiate human genes from genes of other species, and are preferably 
based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to 
these specific sequences, but also include allelic and species variations thereof. Allelic 

10 and species variations can be routinely determined by comparing the sequence provided in 
SEQ ID NO: 1 - 419, a representative fragment thereof, or a nucleotide sequence at least 
90% identical, preferably 95% identical, to SEQ ID NOs: 1 - 419 with a sequence from 
another isolate of the same species. Furthermore, to accommodate codon variability, the 
invention includes nucleic acid molecules coding for the same amino acid sequences as do 

1 5 the specific ORFs disclosed herein. In other words, in the coding region of an ORF, 

substitution of one codon for another codon that encodes the same amino acid is expressly 
contemplated. 

The nearest neighbor result for the nucleic acids of the present invention, including 
SEQ ID NOs: 1 - 419, can be obtained by searching a database using an algorithm or a 
20 program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is 
used to search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 
(1993) and Altschul S.F. et al. J. Mol. Biol. 21:403-410 (1990)). Alternatively a FASTA 
version 3 search against Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins 
25 are also provided by the present invention. Species homologs may be isolated and 

identified by making suitable probes or primers from the sequences provided herein and 
screening a suitable nucleic acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides 
or proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide 
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which also encode proteins which are identical, homologous or related to that encoded 
by the polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences 
which encode variants of the described nucleic acids. These amino acid sequence 
variants may be prepared by methods known in the art by introducing appropriate 
nucleotide changes into a native or variant polynucleotide. There are two variables in 
the construction of amino acid sequence variants: the location of the mutation and the 
nature of the mutation. Nucleic acids encoding the amino acid sequence variants are 
preferably constructed by mutating the polynucleotide to encode an amino acid 
sequence that does not occur in nature. These nucleic acid alterations can be made at 
sites that differ in the nucleic acids from different species (variable positions) or in 
highly conserved regions (constant regions). Sites at such locations will typically be 
modified in series, e.g., by substituting first with conservative choices (e.g., 
hydrophobic amino acid to a different hydrophobic amino acid) and then with more 
distant choices (e.g. , hydrophobic amino acid to a charged amino acid), and then 
deletions or insertions may be made at the target site. Amino acid sequence deletions 
generally range from about 1 to 30 residues, preferably about 1 to 10 residues, and are 
typically contiguous. Amino acid insertions include amino- and/or carboxyl-terminal 
fusions ranging in length from one to one hundred or more residues, as well as 
intrasequence insertions of single or multiple amino acid residues. Intrasequence 
insertions may range generally from about 1 to 10 amino residues, preferably from 1 to 
5 residues. Examples of terminal insertions include the heterologous signal sequences 
necessary for secretion or for intracellular targeting in different host cells and 
sequences such as FLAG or poly-histidine sequences useful for purifying the expressed 
protein, 

In a preferred method, polynucleotides encoding the novel amino acid sequences 
are changed via site-directed mutagenesis. This method uses oligonucleotide sequences 
to alter a polynucleotide to encode the desired amino acid variant, as well as sufficient 
adjacent nucleotides on both sides of the changed amino acid to form a stable duplex on 
either side of the site of being changed. In general, the techniques of site-directed 
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mutagenesis are well known to those of skill in the art and this technique is exemplified 
by publications such as, Edelman et ai.,DNA 2:183 (1983). A versatile and efficient 
method for producing site-specific changes in a polynucleotide sequence was published 
by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 (1982). PCR may also be used 
to create amino acid sequence variants of the novel nucleic acids. When small amounts 
of template DNA are used as starting material, primer(s) that differs slightly in 
sequence from the corresponding region in the template DNA can generate the desired 
amino acid variant. PCR amplification results in a population of product DNA 
fragments that differ from the polynucleotide template encoding the polypeptide at the 
position specified by the primer. The product DNA fragments replace the 
corresponding region in the plasmid and this gives a polynucleotide encoding the 
desired amino acid variant. 

A further technique for generating amino acid variants is the cassette 
mutagenesis technique described in Wells et aL, Gene 34:315 (1985); and other 
mutagenesis techniques well known in the art, such as, for example, the techniques in 
Sambrook et aL, supra, and Current Protocols in Molecular Biology, Ausubel et al. 
Due to the inherent degeneracy of the genetic code, other DNA sequences which 
encode substantially the same or a functionally equivalent amino acid sequence may be 
used in the practice of the invention for the cloning and expression of these novel 
nucleic acids. Such DNA sequences include those which are capable of hybridizing to 
the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can 
be used to generate polynucleotides encoding chimeric or fusion proteins comprising 
one or more domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any 
of the polynucleotides recited above. The polynucleotide can be DNA (genomic, 
cDNA, amplified, or synthetic) or RNA. Methods and algorithms for obtaining such 
polynucleotides are well known to those of skill in the art and can include, for example, 
methods for determining hybridization conditions that can routinely isolate 
polynucleotides of the desired sequence identities. 
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In accordance with the invention, polynucleotide sequences comprising the 
mature protein coding sequences corresponding to any one of SEQ ID NO: 1-419, or 
functional equivalents thereof, may be used to generate recombinant DNA molecules 
that direct the expression of that nucleic acid, or a functional equivalent thereof, in 
5 appropriate host cells. Also included are the cDNA inserts of any of the clones 
identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of 
other nucleotide sequences by well-established recombinant DNA techniques (see 
Sambrook J et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring 
10 Harbor Laboratory, NY). Useful nucleotide sequences for joining to polynucleotides 
include an assortment of vectors, e.g., plasmids, cosmids, lambda phage derivatives, 
^ phagemids, and the like, that are well known in the art. Accordingly, the invention 
w also provides a vector including a polynucleotide of the invention and a host cell 

if •"! 

if [J containing the polynucleotide. In general, the vector contains an origin of replication 
p 15 functional in at least one organism, convenient restriction endonuclease sites, and a 
f « selectable marker for the host cell. Vectors according to the invention include 
» expression vectors, replication vectors, probe generation vectors, and sequencing 

Q vectors. A host cell according to the invention can be a prokaryotic or eukaryotic cell 
^ and can be a unicellular organism or part of a multicellular organism. 
O20 The present invention further provides recombinant constructs comprising a 

nucleic acid having any of the nucleotide sequences of the SEQ ID NOs: 1 - 419 or a 
fragment thereof or any other polynucleotides of the invention. In one embodiment, the 
recombinant constructs of the present invention comprise a vector, such as a plasmid or 
viral vector, into which a nucleic acid having any of the nucleotide sequences of the 
25 SEQ ID NOs: 1 - 419 or a fragment thereof is inserted, in a forward or reverse 
orientation. In the case of a vector comprising one of the ORFs of the present 
invention, the vector may further comprise regulatory sequences, including for 
example, a promoter, operably linked to the ORF. Large numbers of suitable vectors 
and promoters are known to those of skill in the art and are commercially available for 
30 generating the recombinant constructs of the present invention. The following vectors 
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are provided by way of example. Bacterial: pBs, phagescript, PsiX174, pBluescript 
SK, pBs KS, pNH8a, pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, 
pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, pOG44, 
PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an 
expression control sequence such as the pMT2 or pED expression vectors disclosed in 
Kaufman et al, Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the 
protein recombinantly. Many suitable expression control sequences are known in the 
art. General methods of expressing recombinant proteins are also known and are 
exemplified in R. Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined 
herein "operably linked" means that the isolated polynucleotide of the invention and an 
expression control sequence are situated within a vector or cell in such a way that the 
protein is expressed by a host cell which has been transformed (transfected) with the 
ligated polynucleotide/expression control sequence. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include 
CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from 
retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and 
promoter is well within the level of ordinary skill in the art. Generally, recombinant 
expression vectors will include origins of replication and selectable markers permitting 
transformation of the host cell, e.g. , the ampicillin resistance gene of E. coli and S. 
cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct 
transcription of a downstream structural sequence. Such promoters can be derived 
from operons encoding glycolytic enzymes such as 3 -phosphogly cerate kinase (PGK), 
a-factor, acid phosphatase, or heat shock proteins, among others. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and 
termination sequences, and preferably, a leader sequence capable of directing secretion 
of translated protein into the periplasmic space or extracellular medium. Optionally, 
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the heterologous sequence can encode a fusion protein including an amino terminal 
identification peptide imparting desired characteristics, e.g. , stabilization or simplified 
purification of expressed recombinant product. Useful expression vectors for bacterial 
use are constructed by inserting a structural DNA sequence encoding a desired protein 
5 together with suitable translation initiation and termination signals in operable reading 
phase with a functional promoter. The vector will comprise one or more phenotypic 
selectable markers and an origin of replication to ensure maintenance of the vector and 
to, if desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and various 
10 species within the genera Pseudomonas, Streptomyces , and Staphylococcus, although 
others may also be employed as a matter of choice. 
r . As a representative but non-limiting example, useful expression vectors for 

2 j bacterial use can comprise a selectable marker and bacterial origin of replication 

derived from commercially available plasmids comprising genetic elements of the well 
t;[ 15 known cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for 
1 1 example, pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 
* (Promega Biotech, Madison, WI, USA). These pBR322 "backbone" sections are 

: j combined with an appropriate promoter and the structural sequence to be expressed. 
^; Following transformation of a suitable host strain and growth of the host strain to an 
i ; 20 appropriate cell density, the selected promoter is induced or derepressed by appropriate 
means {e.g., temperature shift or chemical induction) and cells are cultured for an 
additional period. Cells are typically harvested by centrifugation, disrupted by physical 
or chemical means, and the resulting crude extract retained for further purification. 

Polynucleotides of the invention can also be used to induce immune responses. 
25 For example, as described in Fan et al., Nat. Biotech. 17:870-872 (1999), incorporated 
herein by reference, nucleic acid sequences encoding a polypeptide may be used to 
generate antibodies against the encoded polypeptide following topical administration of 
naked plasmid DNA or following injection, and preferably intramuscular injection of 
the DNA. The nucleic acid sequences are preferably inserted in a recombinant 
30 expression vector and may be in the form of naked DNA. 
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4.3 HOSTS 

The present invention further provides host cells genetically engineered to 
contain the polynucleotides of the invention. For example, such host cells may contain 
5 nucleic acids of the invention introduced into the host cell using known transformation, 
transfection or infection methods. The present invention still further provides host cells 
genetically engineered to express the polynucleotides of the invention, wherein such 
polynucleotides are in operative association with a regulatory sequence heterologous to 
the host cell which drives expression of the polynucleotides in the cell. 
10 Knowledge of nucleic acid sequences allows for modification of cells to permit, 

or increase, expression of endogenous polypeptide. Cells can be modified (e.g. , by 
n homologous recombination) to provide increased polypeptide expression by replacing, 
j in whole or in part, the naturally occurring promoter with all or part of a heterologous 
) if promoter so that the cells express the polypeptide at higher levels. The heterologous 
i, .} 15 promoter is inserted in such a manner that it is operatively linked to the encoding 
| |;f sequences. See, for example, PCT International Publication No. WO94/12650, PCT 

V i 

I International Publication No. WO92/20808, and PCT International Publication No. 

%j W09 1/09955. It is also contemplated that, in addition to heterologous promoter DNA, 
J* amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
O20 encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) 
and/or intron DNA may be inserted along with the heterologous promoter DNA. If 
linked to the coding sequence, amplification of the marker DNA by standard selection 
methods results in co-amplification of the desired protein coding sequences in the cells. 
The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a 
25 lower eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic 
cell, such as a bacterial cell. Introduction of the recombinant construct into the host 
cell can be effected by calcium phosphate transfection, DEAE, dextran mediated 
transfection, or electroporation (Davis, L. et al., Basic Methods in Molecular Biology 
(1986)). The host cells containing one of the polynucleotides of the invention, can be 
30 used in conventional manners to produce the gene product encoded by the isolated 
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fragment (in the case of an ORF) or can be used to produce a heterologous protein 
under the control of the EMF. 

Any host/vector system can be used to express one or more of the ORFs of the 
present invention. These include, but are not limited to, eukaryotic hosts such as HeLa 
cells, Cv-1 cell, COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as 
E. coli and B, subtilis. The most preferred cells are those which do not normally 
express the particular polypeptide or protein or which expresses the polypeptide or 
protein at low natural level. Mature proteins can be expressed in mammalian cells, 
yeast, bacteria, or other cells under the control of appropriate promoters. Cell-free 
translation systems can also be employed to produce such proteins using RNAs derived 
from the DNA constructs of the present invention. Appropriate cloning and expression 
vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et 
al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring 
Harbor, New York (1989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts, described by Gluzman, Cell 23:175 (1981). Other 
cell lines capable of expressing a compatible vector are, for example, the C127, 
monkey COS cells, Chinese Hamster Ovary (CHO) cells, human kidney 293 cells, 
human epidermal A431 cells, human Colo205 cells, 3T3 cells, CV-1 cells, other 
transformed primate cell lines, normal diploid cells, cell strains derived from in vitro 
culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, HL-60, 
U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of 
replication, a suitable promoter and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, transcriptional termination 
sequences, and 5' flanking nontranscribed sequences. DNA sequences derived from 
the SV40 viral genome, for example, SV40 origin, early promoter, enhancer, splice, 
and polyadenylation sites may be used to provide the required nontranscribed genetic 
elements. Recombinant polypeptides and proteins produced in bacterial culture are 
usually isolated by initial extraction from cell pellets, followed by one or more 
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salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature 
protein. Finally, high performance liquid chromatography (HPLC) can be employed 
for final purification steps. Microbial cells employed in expression of proteins can be 
disrupted by any convenient method, including fireeze-thaw cycling, sonication, 
mechanical disruption, or use of cell lysing agents. 

Alternatively, it may be possible to produce the protein in lower eukaryotes such 
as yeast or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains 
include Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, 
Candida, or any yeast strain capable of expressing heterologous proteins. Potentially 
suitable bacterial strains include Escherichia coli, Bacillus subtilis, Salmonella 
typhimurium, or any bacterial strain capable of expressing heterologous proteins. If the 
protein is made in yeast or bacteria, it may be necessary to modify the protein produced 
therein, for example by phosphorylation or glycosylation of the appropriate sites, in 
order to obtain the functional protein. Such covalent attachments may be accomplished 
using known chemical or enzymatic methods. 

In another embodiment of the present invention, cells and tissues may be 
engineered to express an endogenous gene comprising the polynucleotides of the 
invention under the control of inducible regulatory elements, in which case the 
regulatory sequences of the endogenous gene may be replaced by homologous 
recombination. As described herein, gene targeting can be used to replace a gene's 
existing regulatory region with a regulatory sequence isolated from a different gene or a 
novel regulatory sequence synthesized by genetic engineering methods. Such 
regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment 
regions, negative regulatory elements, transcriptional initiation sites, regulatory protein 
binding sites or combinations of said sequences. Alternatively, sequences which affect 
the structure or stability of the RNA or protein produced may be replaced, removed, 
added, or otherwise modified by targeting. These sequence include polyadenylation 
signals, mRNA stability elements, splice sites, leader sequences for enhancing or 
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modifying transport or secretion properties of the protein, or other sequences which 
alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, 
placing the gene under the control of the new regulatory sequence, e.g., inserting a new 
5 promoter or enhancer or both upstream of a gene. Alternatively, the targeting event 
may be a simple deletion of a regulatory element, such as the deletion of a 
tissue-specific negative regulatory element. Alternatively, the targeting event may 
replace an existing element; for example, a tissue-specific enhancer can be replaced by 
an enhancer that has broader or different cell-type specificity than the naturally 
10 occurring elements. Here, the naturally occurring sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be 
f facilitated by the use of one or more selectable marker genes that are contiguous with 

the targeting DNA, allowing for the selection of cells in which the exogenous DNA has 
: : j integrated into the host cell genome. The identification of the targeting event may also 
n 15 be facilitated by the use of one or more marker genes exhibiting the property of 
I • ; negative selection, such that the negatively selectable marker is linked to the exogenous 
u DNA, but configured such that the negatively selectable marker flanks the targeting 

I sequence, and such that a correct homologous recombination event with sequences in 
; r ? the host cell genome does not result in the stable integration of the negatively selectable 
; j 20 marker. Markers useful for this purpose include the Herpes Simplex Virus thymidine 
kinase (TK) gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) 
gene. 

The gene targeting or gene activation techniques which can be used in 
accordance with this aspect of the invention are more particularly described in U.S. 
25 Patent No. 5,272,071 to Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; 

International Application No. PCT/US92/09627 (WO93/09222) by Selden et al.; and 
International Application No. PCT/US90/06436 (WO91/06667) by Skoultchi et al., 
each of which is incorporated by reference herein in its entirety. 

30 4.4 POLYPEPTIDES OF THE INVENTION 
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The isolated polypeptides of the invention include, but are not limited to, a 
polypeptide comprising: the amino acid sequences set forth as any one of SEQ ID NO: 
1-419 or an amino acid sequence encoded by any one of the nucleotide sequences SEQ 
ID NOs: 1 - 419 or the corresponding full length or mature protein. Polypeptides of the 
invention also include polypeptides preferably with biological or immunological activity 
that are encoded by: (a) a polynucleotide having any one of the nucleotide sequences set 
forth in the SEQ ID NOs: 1 - 419 or (b) polynucleotides encoding any one of the amino 
acid sequences set forth as SEQ ID NO: 1-419 or (c) polynucleotides that hybridize to 
the complement of the polynucleotides of either (a) or (b) under stringent hybridization 
conditions. The invention also provides biologically active or immunologically active 
variants of any of the amino acid sequences set forth as SEQ ID NO: 1-419 or the 
corresponding full length or mature protein; and "substantial equivalents" thereof 
(e.g., with at least about 65%, at least about 70%, at least about 75%, at least about 
80%, at least about 85%, at least about 90%, typically at least about 95%, more 
typically at least about 98%, or most typically at least about 99% amino acid identity) 
that retain biological activity. Polypeptides encoded by allelic variants may have a 
similar, increased, or decreased activity compared to polypeptides comprising SEQ ID 
NO: 1-419. 

Fragments of the proteins of the present invention which are capable of 
exhibiting biological activity are also encompassed by the present invention. Fragments 
of the protein may be in linear form or they may be cyclized using known methods, for 
example, as described in H. U. Saragovi, et aL, Bio/Technology 10, 773-778 (1992) 
and in R. S. McDowell, et aL, J. Amer. Chem. Soc. 114, 9245-9253 (1992), both of 
which are incorporated herein by reference. Such fragments may be fused to carrier 
molecules such as immunoglobulins for many purposes, including increasing the 
valency of protein binding sites. 

The present invention also provides both full-length and mature forms (for 
example, without a signal sequence or precursor sequence) of the disclosed proteins. 
The protein coding sequence is identified in the sequence listing by translation of the 
disclosed nucleotide sequences. The mature form of such protein may be obtained by 
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expression of a full-length polynucleotide in a suitable mammalian cell or other host 
cell. The sequence of the mature form of the protein is also determinable from the 
amino acid sequence of the full-length form. Where proteins of the present invention 
are membrane bound, soluble forms of the proteins are also provided. In such forms, 
5 part or all of the regions causing the proteins to be membrane bound are deleted so that 
the proteins are fully secreted from the cell in which it is expressed. 

Protein compositions of the present invention may further comprise an 
acceptable carrier, such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 
The present invention further provides isolated polypeptides encoded by the 
10 nucleic acid fragments of the present invention or by degenerate variants of the nucleic 
acid fragments of the present invention. By "degenerate variant" is intended nucleotide 
fragments which differ from a nucleic acid fragment of the present invention (e.g. , an 
ORF) by nucleotide sequence but, due to the degeneracy of the genetic code, encode an 
identical polypeptide sequence. Preferred nucleic acid fragments of the present 
;\ 15 invention are the ORFs that encode proteins. 

A variety of methodologies known in the art can be utilized to obtain any one of 
the isolated polypeptides or proteins of the present invention. At the simplest level, the 
amino acid sequence can be synthesized using commercially available peptide 
synthesizers. The synthetically-constructed protein sequences, by virtue of sharing 
120 primary, secondary or tertiary structural and/or conformational characteristics with 
proteins may possess biological properties in common therewith, including protein 
activity. This technique is particularly useful in producing small peptides and 
fragments of larger polypeptides. Fragments are useful, for example, in generating 
antibodies against the native polypeptide. Thus, they may be employed as biologically 
25 active or immunological substitutes for natural, purified proteins in screening of 
therapeutic compounds and in immunological processes for the development of 
antibodies. 

The polypeptides and proteins of the present invention can alternatively be 
purified from cells which have been altered to express the desired polypeptide or 
30 protein. As used herein, a cell is said to be altered to express a desired polypeptide or 
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protein when the cell, through genetic manipulation, is made to produce a polypeptide 
or protein which it normally does not produce or which the cell normally produces at a 
lower level. One skilled in the art can readily adapt procedures for introducing and 
expressing either recombinant or synthetic sequences into eukaryotic or prokaryotic 
cells in order to generate a cell which produces one of the polypeptides or proteins of 
the present invention. 

The invention also relates to methods for producing a polypeptide comprising 
growing a culture of host cells of the invention in a suitable culture medium, and 
purifying the protein from the cells or the culture in which the cells are grown. For 
example, the methods of the invention include a process for producing a polypeptide in 
which a host cell containing a suitable expression vector that includes a polynucleotide 
of the invention is cultured under conditions that allow expression of the encoded 
polypeptide. The polypeptide can be recovered from the culture, conveniently from the 
culture medium, or from a lysate prepared from the host cells and further purified. 
Preferred embodiments include those in which the protein produced by such process is 
a full length or mature form of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial 
cells which naturally produce the polypeptide or protein. One skilled in the art can 
readily follow known methods for isolating polypeptides and proteins in order to obtain 
one of the isolated polypeptides or proteins of the present invention. These include, but 
are not limited to, immunochromatography, HPLC, size-exclusion chromatography, 
ion-exchange chromatography, and immuno-affinity chromatography. See, e.g. , 
Scopes, Protein Purification: Principles and Practice, Springer-Verlag (1994); 
Sambrook, et al., in Molecular Cloning: A Laboratory Manual; Ausubel et al., 
Current Protocols in Molecular Biology. Polypeptide fragments that retain 
biological/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fragments that encode specific 
protein domains. 

The purified polypeptides can be used in in vitro binding assays which are well 
known in the art to identify molecules which bind to the polypeptides. These molecules 
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include but are not limited to, for e.g., small molecules, molecules from combinatorial 
libraries, antibodies or other proteins. The molecules identified in the binding assay are 
then tested for antagonist or agonist activity in in vivo tissue culture or animal models 
that are well known in the art. In brief, the molecules are titrated into a plurality of 
5 cell cultures or animals and then tested for either cell/animal death or prolonged 
survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the 
peptides may be complexed with toxins, e.g., ricin or cholera, or with other compounds 
that are toxic to cells. The toxin-binding molecule complex is then targeted to a tumor 
10 or other cell by the specificity of the binding molecule for SEQ ID NO: 1-419. 

The protein of the invention may also be expressed as a product of transgenic 
i animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep 

; ' which are characterized by somatic or germ cells containing a nucleotide sequence 
s'Li encoding the protein. 

r ;15 The proteins provided herein also include proteins characterized by amino acid 

I sequences similar to those of purified proteins but into which modification are naturally 

provided or deliberately engineered. For example, modifications, in the peptide or 
.\ DNA sequence, can be made by those skilled in the art using known techniques. 
; I Modifications of interest in the protein sequences may include the alteration, 
( 20 substitution, replacement, insertion or deletion of a selected amino acid residue in the 
coding sequence. For example, one or more of the cysteine residues may be deleted or 
replaced with another amino acid to alter the conformation of the molecule. 
Techniques for such alteration, substitution, replacement, insertion or deletion are well 
known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such 
25 alteration, substitution, replacement, insertion or deletion retains the desired activity of 
the protein. Regions of the protein that are important for the protein function can be 
determined by various methods known in the art including the alanine-scanning method 
which involved systematic substitution of single or strings of amino acids with alanine, 
followed by testing the resulting alanine-containing variant for biological activity. This 
30 type of analysis determines the importance of the substituted amino acid(s) in biological 
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activity. Regions of the protein that are important for protein function may be 
determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be 
expected to retain protein activity in whole or in part and are useful for screening or 
5 other immunological methodologies may also be easily made by those skilled in the art 
given the disclosures herein. Such modifications are encompassed by the present 
invention. 

The protein may also be produced by operably linking the isolated 
polynucleotide of the invention to suitable control sequences in one or more insect 
10 expression vectors, and employing an insect expression system. Materials and methods 
for baculovirus/insect cell expression systems are commercially available in kit form 
. from, e.g., Invitrogen, San Diego, Calif., U.S.A. (the MaxBat™ kit), and such methods 

are well known in the art, as described in Summers and Smith, Texas Agricultural 
j U Experiment Station Bulletin No. 1555 (1987), incorporated herein by reference. As 
J "j 115 used herein, an insect cell capable of expressing a polynucleotide of the present 

is ; s 

\ " ■ invention is "transformed. " 

; The protein of the invention may be prepared by culturing transformed host cells 

under culture conditions suitable to express the recombinant protein. The resulting 
IJ expressed protein may then be purified from such culture (i.e. , from culture medium or 

) ; ;B0 cell extracts) using known purification processes, such as gel filtration and ion 

exchange chromatography. The purification of the protein may also include an affinity 
column containing agents which will bind to the protein; one or more column steps over 
such affinity resins as concanavalin A-agarose, heparin-toyopearl™ or Cibacrom blue 
3GA Sepharose™; one or more steps involving hydrophobic interaction chromatography 
25 using such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity 
chromatography. 

Alternatively, the protein of the invention may also be expressed in a form 
which will facilitate purification. For example, it may be expressed as a fusion protein, 
such as those of maltose binding protein (MBP), glutathione-S-transferase (GST) or 
30 thioredoxin (TRX), or as a His tag. Kits for expression and purification of such fusion 
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proteins are commercially available from New England BioLab (Beverly, Mass,), 
Pharmacia (Piscataway, N.J.) and Invitrogen, respectively. The protein can also be 
tagged with an epitope and subsequently purified by using a specific antibody directed 
to such epitope. One such epitope ("FLAG® ") is commercially available from Kodak 
5 (New Haven, Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography 
(RP- HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having 
pendant methyl or other aliphatic groups, can be employed to further purify the protein. 
Some or all of the foregoing purification steps, in various combinations, can also be 
10 employed to provide a substantially homogeneous isolated recombinant protein. The 
protein thus purified is substantially free of other mammalian proteins and is defined in 
fS% accordance with the present invention as an "isolated protein. " 
U The polypeptides of the invention include analogs (variants). This embraces 

fU fragments, as well as peptides in which one or more amino acids has been deleted, 
l ]15 inserted, or substituted. Also, analogs of the polypeptides of the invention embrace 
j jj fusions of the polypeptides or modifications of the polypeptides of the invention, 

wherein the polypeptide or analog is fused to another moiety or moieties, e.g. , targeting 
kj moiety or another therapeutic agent. Such analogs may exhibit improved properties 
J i: such as activity and/or stability. Examples of moieties which may be fused to the 
1/20 polypeptide or an analog include, for example, targeting moieties which provide for the 
delivery of polypeptide to pancreatic cells, e.g., antibodies to pancreatic cells, 
antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, 
etc., as well as receptor and ligands expressed on pancreatic or immune cells. Other 
moieties which may be fused to the polypeptide include therapeutic agents which are 
25 used for treatment, for example, immunosuppressive drugs such as cyclosporin, SK506, 
azathioprine, CD3 antibodies and steroids. Also, polypeptides may be fused to immune 
modulators, and other cytokines such as alpha or beta interferon. 



4.4.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE 
30 IDENTITY AND SIMILARITY 
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Preferred identity and/or similarity are designed to give the largest match 
between the sequences tested. Methods to determine identity and similarity are codified 
in computer programs including, but are not limited to, the GCG program package, 
including GAP (Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); 
Genetics Computer Group, University of Wisconsin, Madison, WI), BLASTP, 
BLASTN, BLASTX, FASTA (Altschul, S.F. et al., J. Molec. Biol. 215:403-410 
(1990), PSI-BLAST (Altschul S.F. et al., Nucleic Acids Res. vol. 25, pp. 3389-3402, 
herein incorporated by reference), eMatrix software (Wu et al., J. Comp. Biol., vol. 6, 
pp. 219-235 (1999), herein incorporated by reference), eMotif software (Nevill- 
Manning et al, ISMB-97, vol 4, pp. 202-209, herein incorporated by reference) and the 
Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 105-31 
(1982), incorporated herein by reference). The BLAST programs are publicly available 
from the National Center for Biotechnology Information (NCBI) and other sources 
(BLAST Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, 
S., et al., J. Mol. Biol. 215:403-410 (1990). 

4.5 GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of 
normal function of the encoded protein. The invention thus provides gene therapy to 
restore normal activity of the polypeptides of the invention; or to treat disease states 
involving polypeptides of the invention. Delivery of a functional gene encoding 
polypeptides of the invention to appropriate cells is effected ex vivo, in situ, or in vivo 
by use of vectors, and more particularly viral vectors (e.g., adenovirus, 
adeno-associated virus, or a retrovirus), or ex vivo by use of physical DNA transfer 
methods (e.g., liposomes or chemical treatments). See, for example, Anderson, Nature, 
supplement to vol. 392, no. 6679, pp. 25-20 (1998). For additional reviews of gene 
therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific 
American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any 
one of the nucleotides of the present invention or a gene encoding the polypeptides of 
the present invention can also be accomplished with extrachromosomal substrates 
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(transient expression) or artificial chromosomes (stable expression). Cells may also be 
cultured ex vivo in the presence of proteins of the present invention in order to 
proliferate or to produce a desired effect on or activity in such cells. Treated cells can 
then be introduced in vivo for therapeutic purposes. Alternatively, it is contemplated 
5 that in other human disease states, preventing the expression of or inhibiting the activity 
of polypeptides of the invention will be useful in treating the disease states. It is 
contemplated that antisense therapy or gene therapy could be applied to negatively 
regulate the expression of polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of 
10 antisense molecules to the nucleic acids of the present invention, their complements, or 
their translated RNA sequences, by methods known in the art. Further, the polypeptides 
of the present invention can be inhibited by using targeted deletion methods, or the 
insertion of a negative regulatory element such as a silencer, which is tissue specific. 
! ;j The present invention still further provides cells genetically engineered in vivo to 

; J15 express the polynucleotides of the invention, wherein such polynucleotides are in operative 
I association with a regulatory sequence heterologous to the host cell which drives 

expression of the polynucleotides in the cell. These methods can be used to increase or 
] decrease the expression of the polynucleotides of the present invention. 
^ Knowledge of DNA sequences provided by the invention allows for modification 

^20 of cells to permit, increase, or decrease, expression of endogenous polypeptide. Cells can 
be modified (e.g., by homologous recombination) to provide increased polypeptide 
expression by replacing, in whole or in part, the naturally occurring promoter with all or 
part of a heterologous promoter so that the cells express the protein at higher levels. The 
heterologous promoter is inserted in such a manner that it is operatively linked to the 
25 desired protein encoding sequences. See, for example, PCT International Publication No. 
WO 94/12650, PCT International Publication No. WO 92/20808, and PCT International 
Publication No. WO 91/09955. It is also contemplated that, in addition to heterologous 
promoter DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD 
gene which encodes carbamyl phosphate synthase, aspartate transcarbamylase, and 
30 dihydroorotase) and/or intron DNA may be inserted along with the heterologous promoter 
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DNA. If linked to the desired protein coding sequence, amplification of the marker DNA 
by standard selection methods results in co-amplification of the desired protein coding 
sequences in the cells. 

In another embodiment of the present invention, cells and tissues may be 
engineered to express an endogenous gene comprising the polynucleotides of the invention 
under the control of inducible regulatory elements, in which case the regulatory sequences 
of the endogenous gene may be replaced by homologous recombination. As described 
herein, gene targeting can be used to replace a gene's existing regulatory region with a 
regulatory sequence isolated from a different gene or a novel regulatory sequence 
synthesized by genetic engineering methods. Such regulatory sequences may be 
comprised of promoters, enhancers, scaffold-attachment regions, negative regulatory 
elements, transcriptional initiation sites, regulatory protein binding sites or combinations 
of said sequences. Alternatively, sequences which affect the structure or stability of the 
RNA or protein produced may be replaced, removed, added, or otherwise modified by 
targeting. These sequences include polyadenylation signals, mRNA stability elements, 
splice sites, leader sequences for enhancing or modifying transport or secretion properties 
of the protein, or other sequences which alter or improve the function or stability of 
protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing 
the gene under the control of the new regulatory sequence, e.g., inserting a new promoter 
or enhancer or both upstream of a gene. Alternatively, the targeting event may be a 
simple deletion of a regulatory element, such as the deletion of a tissue-specific negative 
regulatory element. Alternatively, the targeting event may replace an existing element; 
for example, a tissue-specific enhancer can be replaced by an enhancer that has broader or 
different cell-type specificity than the naturally occurring elements. Here, the naturally 
occurring sequences are deleted and new sequences are added. In all cases, the 
identification of the targeting event may be facilitated by the use of one or more selectable 
marker genes that are contiguous with the targeting DNA, allowing for the selection of 
cells in which the exogenous DNA has integrated into the cell genome. The identification 
of the targeting event may also be facilitated by the use of one or more marker genes 
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exhibiting the property of negative selection, such that the negatively selectable marker is 
linked to the exogenous DNA, but configured such that the negatively selectable marker 
flanks the targeting sequence, and such that a correct homologous recombination event 
with sequences in the host cell genome does not result in the stable integration of the 
5 negatively selectable marker. Markers useful for this purpose include the Herpes Simplex 
Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance 
with this aspect of the invention are more particularly described in U.S. Patent No. 
10 5,272,071 to Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International 

Application No. PCT/US92/09627 (WO93/09222) by Selden et al.; and International 
Application No. PCT/US90/06436 (WO91/06667) by Skoultchi et al. , each of which is 
incorporated by reference herein in its entirety. 

L !l5 4.6 TRANSGENIC ANIMALS 

^; • In preferred methods to determine biological functions of the polypeptides of the 

invention in vivo, one or more genes provided by the invention are either over 
V'\ expressed or inactivated in the germ line of animals using homologous recombination 
' [Capecchi, Science 244:1288-1292 (1989)]. Animals in which the gene is over 
s 720 expressed, under the regulatory control of exogenous or endogenous promoter 

elements, are known as transgenic animals. Animals in which an endogenous gene has 
been inactivated by homologous recombination are referred to as "knockout" animals. 
Knockout animals, preferably non-human mammals, can be prepared as described in 
U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic animals are 
25 useful to determine the roles polypeptides of the invention play in biological processes, 
and preferably in disease states. Transgenic animals are useful as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably 
non-human mammals, are produced using methods as described in U.S. Patent No 
5,489,743 and PCT Publication No. W094/28122, incorporated herein by reference. 
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Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of 
expression of the polypeptides of the invention. Inactivation can be carried out using 
homologous recombination methods described above. Activation can be achieved by 
supplementing or even replacing the homologous promoter to provide for increased 
protein expression. The homologous promoter can be supplemented by insertion of one 
or more heterologous enhancer elements known to confer promoter activation in a 
particular tissue. 

The polynucleotides of the present invention also make possible the 
development, through, e.g., homologous recombination or knock out strategies, of 
animals that fail to express polypeptides of the invention or that express a variant 
polypeptide. Such animals are useful as models for studying the in vivo activities of 
polypeptide as well as for studying modulators of the polypeptides of the invention. 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over 
expressed or inactivated in the germ line of animals using homologous recombination 
[Capecchi, Science 244:1288-1292 (1989)]. Animals in which the gene is over 
expressed, under the regulatory control of exogenous or endogenous promoter 
elements, are known as transgenic animals. Animals in which an endogenous gene has 
been inactivated by homologous recombination are referred to as "knockout" animals. 
Knockout animals, preferably non-human mammals, can be prepared as described in 
U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic animals are 
useful to determine the roles polypeptides of the invention play in biological processes, 
and preferably in disease states. Transgenic animals are useful as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably 
non-human mammals, are produced using methods as described in U.S. Patent No 
5,489,743 and PCT Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of 
the invention promoter is either activated or inactivated to alter the level of expression 
of the polypeptides of the invention. Inactivation can be carried out using homologous 
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recombination methods described above. Activation can be achieved by supplementing 
or even replacing the homologous promoter to provide for increased protein expression. 
The homologous promoter can be supplemented by insertion of one or more 
heterologous enhancer elements known to confer promoter activation in a particular 
5 tissue. 

4.7 USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit 
one or more of the uses or biological activities (including those associated with assays 

10 cited herein) identified herein. Uses or activities described for proteins of the present 
invention may be provided by administration or use of such proteins or of 
polynucleotides encoding such proteins (such as, for example, in gene therapies or 
vectors suitable for introduction of DNA). The mechanism underlying the particular 
condition or pathology will dictate whether the polypeptides of the invention, the 

15 polynucleotides of the invention or modulators (activators or inhibitors) thereof would 
be beneficial to the subject in need of treatment. Thus, "therapeutic compositions of 
the invention" include compositions comprising isolated polynucleotides (including 
recombinant DNA molecules, cloned genes and degenerate variants thereof) or 
polypeptides of the invention (including full length protein, mature protein and 

20 truncations or domains thereof), or compounds and other substances that modulate the 
overall activity of the target gene products, either at the level of target gene/protein 
expression or target protein activity. Such modulators include polypeptides, analogs, 
(variants), including fragments and fusion proteins, antibodies and other binding 
proteins; chemical compounds that directly or indirectly activate or inhibit the 

25 polypeptides of the invention (identified, e.g., via drug screening assays as described 
herein); antisense polynucleotides and polynucleotides suitable for triple helix 
formation; and in particular antibodies or other binding partners that specifically 
recognize one or more epitopes of the polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular 

30 activation or in one of the other physiological pathways described herein. 



787CIP2B 



41 



4.7.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the 
research community for various purposes. The polynucleotides can be used to express 
5 recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either 
constitutively or at a particular stage of tissue differentiation or development or in 
disease states); as molecular weight markers on gels; as chromosome markers or tags 
(when labeled) to identify chromosomes or to map related gene positions; to compare 

10 with endogenous DNA sequences in patients to identify potential genetic disorders; as 
probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to 
"subtract-out" known sequences in the process of discovering other novel 
polynucleotides; for selecting and making oligomers for attachment to a "gene chip" or 

15 other support, including for examination of expression patterns; to raise anti-protein 
antibodies using DNA immunization techniques; and as an antigen to raise anti-DNA 
antibodies or elicit another immune response. Where the polynucleotide encodes a 
protein which binds or potentially binds to another protein (such as, for example, in a 
receptor-ligand interaction), the polynucleotide can also be used in interaction trap 

20 assays (such as, for example, that described in Gyuris et al., Cell 75:791-803 (1993)) to 
identify polynucleotides encoding the other protein with which binding occurs or to 
identify inhibitors of the binding interaction. 

The polypeptides provided by the present invention can similarly be used in 
assays to determine biological activity, including in a panel of multiple proteins for 

25 high-throughput screening; to raise antibodies or to elicit another immune response; as 
a reagent (including the labeled reagent) in assays designed to quantitatively determine 
levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 
the corresponding polypeptide is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in a disease state); and, of 

30 course, to isolate correlative receptors or ligands. Proteins involved in these binding 
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interactions can also be used to screen for peptide or small molecule inhibitors or 
agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent 
grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in 
the art. References disclosing such methods include without limitation "Molecular 
Cloning: A Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, 
Sambrook, J., E. F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: 
Guide to Molecular Cloning Techniques", Academic Press, Berger, S. L. and A. R. 
Kimmel eds., 1987. 

4.7.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as 
nutritional sources or supplements. Such uses include without limitation use as a protein 
or amino acid supplement, use as a carbon source, use as a nitrogen source and use as a 
source of carbohydrate. In such cases the polypeptide or polynucleotide of the invention 
can be added to the feed of a particular organism or can be administered as a separate 
solid or liquid preparation, such as in the form of powder, pills, solutions, suspensions or 
capsules. In the case of microorganisms, the polypeptide or polynucleotide of the 
invention can be added to the medium in or on which the microorganism is cultured. 

4.7.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, 
cell proliferation (either inducing or inhibiting) or cell differentiation (either inducing or 
inhibiting) activity or may induce production of other cytokines in certain cell 
populations. A polynucleotide of the invention can encode a polypeptide exhibiting such 
attributes. Many protein factors discovered to date, including all known cytokines, 
have exhibited activity in one or more factor-dependent cell proliferation assays, and 
hence the assays serve as a convenient confirmation of cytokine activity. The activity of 
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therapeutic compositions of the present invention is evidenced by any one of a number 
of routine factor dependent cell proliferation assays for cell lines including, without 
limitation, 32D, DA2, DA1G, T10, B9, B9/11, BaF3, MC9/G, M+(preB M+), 2E8, 
RB5, DAI, 123, T1165, HT2, CTLL2, TF-1, Mo7e, CMK, HUVEC, and Caco. 
Therapeutic compositions of the invention can be used in the following: 

Assays for T-cell or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. 
Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte 
Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Takai et al., J. 
Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 145:1706-1712, 1990; 
Bertagnollietal., Cellular Immunology 133:327-341, 1991; Bertagnolli, etal., I. 
Immunol. 149:3778-3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node 
cells or thymocytes include, without limitation, those described in: Polyclonal T cell 
stimulation, Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, 
Toronto. 1994; and Measurement of mouse and human interleukin-y, Schreiber, R. D. 
In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, 
John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic 
cells include, without limitation, those described in: Measurement of Human and 
Murine Interleukin 2 and Interleukin 4, Bottomry, K., Davis, L. S. and Lipsky, P. E. 
In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, 
John Wiley and Sons, Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-1211, 
1991; Moreau et al., Nature 336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. 
Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse and human interleukin 
6~Nordan, R. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 
6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. 
U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin ll«Bennett, F., 
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Giannotti, J., Clark, S, C. and Turner, K, J. In Current Protocols in Immunology. J. 
E. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; Measurement 
of mouse and human Interleukin 9~Ciarletta, A., Giannotti, J., Clark, S. C. and 
Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 
6.13.1, John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which will identify, among 
others, proteins that affect APC-T cell interactions as well as direct T-cell effects by 
measuring proliferation and cytokine production) include, without limitation, those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. 
Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte 
Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, Immunologic 
studies in Humans); Weinberger et aL, Proc. Natl. Acad. Sci. USA 77:6091-6095, 
1980; Weinberger et aL, Eur. J. Immun. 11:405-411, 1981; Takai et al., L Immunol. 
137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988. 

4.7.4 STEM CELL GROWTH FACTOR ACTIVITY 

A polypeptide of the present invention may exhibit stem cell growth factor 
activity and be involved in the proliferation, differentiation and survival of pluripotent 
and totipotent stem cells including primordial germ cells, embryonic stem cells, 
hematopoietic stem cells and/or germ line stem cells. Administration of the polypeptide 
of the invention to stem cells in vivo or ex vivo is expected to maintain and expand cell 
populations in a totipotential or pluripotential state which would be useful for re- 
engineering damaged or diseased tissues, transplantation, manufacture of bio- 
pharmaceuticals and the development of bio-sensors. The ability to produce large 
quantities of human cells has important working applications for the production of 
human proteins which currently must be obtained from non-human sources or donors, 
implantation of cells to treat diseases such as Parkinson's, Alzheimer's and other 
neurodegenerative diseases; tissues for grafting such as bone marrow, skin, cartilage, 
tendons, bone, muscle (including cardiac muscle), blood vessels, cornea, neural cells, 
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gastrointestinal cells and others; and organs for transplantation such as kidney, liver, 
pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or 
cytokines may be administered in combination with the polypeptide of the invention to 
achieve the desired effect, including any of the growth factors listed herein, other stem 
cell maintenance factors, and specifically including stem cell factor (SCF), leukemia 
inhibitory factor (LIF), Flt-3 ligand (Flt-3L), any of the interleukins, recombinant 
soluble IL-6 receptor fused to IL-6, macrophage inflammatory protein 1 -alpha (MIP-1- 
alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet factor 4 (PF-4), platelet- 
derived growth factor (PDGF), neural growth factors and basic fibroblast growth factor 
(bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, 
expansion of these cells in culture will facilitate the production of large quantities of 
mature cells. Techniques for culturing stem cells are known in the art and 
administration of polypeptides of the invention, optionally with other growth factors 
and/or cytokines, is expected to enhance the survival and proliferation of the stem cell 
populations. This can be accomplished by direct administration of the polypeptide of 
the invention to the culture medium. Alternatively, stroma cells transfected with a 
polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 
layer for the stem cell populations in culture or in vivo. Stromal support cells for 
feeder layers may include embryonic bone marrow fibroblasts, bone marrow stromal 
cells, fetal liver cells, or cultured embryonic fibroblasts (see U.S. Patent No. 
5,690,926). 

Stem cells themselves can be transfected with a polynucleotide of the invention 
to induce autocrine expression of the polypeptide of the invention. This will allow for 
generation of undifferentiated totipotential/pluripotential stem cell lines that are useful 
as is or that can then be differentiated into the desired mature cell types. These stable 
cell lines can also serve as a source of undifferentiated totipotential/pluripotential 
mRNA to create cDNA libraries and templates for polymerase chain reaction 
experiments. These studies would allow for the isolation and identification of 
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differentially expressed genes in stem cell populations that regulate stem cell 
proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in 
the treatment of many pathological conditions. For example, polypeptides of the 
present invention may be used to manipulate stem cells in culture to give rise to 
neuroepithelial cells that can be used to augment or replace cells damaged by illness, 
autoimmune disease, accidental damage or genetic disorders. The polypeptide of the 
invention may be useful for inducing the proliferation of neural cells and for the 
regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In 
addition, the expanded stem cell populations can also be genetically altered for gene 
therapy purposes and to decrease host rejection of replacement tissues after grafting or 
implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can 
also be manipulated to achieve controlled differentiation of the stem cells into more 
differentiated cell types. A broadly applicable method of obtaining pure populations of 
a specific differentiated cell type from undifferentiated stem cell populations involves 
the use of a cell-type specific promoter driving a selectable marker. The selectable 
marker allows only cells of the desired type to survive. For example, stem cells can be 
induced to differentiate into cardiomyocytes (Wobus et al., Differentiation, 48: 173- 
182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) or skeletal muscle 
cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 
Academic Press (1997)). Alternatively, directed differentiation of stem cells can be 
accomplished by culturing the stem cells in the presence of a differentiation factor such 
as retinoic acid and an antagonist of the polypeptide of the invention which would 
inhibit the effects of endogenous stem cell factor activity and allow differentiation to 
proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the 
invention exhibits stem cell growth factor activity. Stem cells are isolated from any one 
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of various cell sources (including hematopoietic stem cells and embryonic stem cells) 
and cultured on a feeder layer, as described by Thompson et al. Proc. Natl. Acad. Sci, 
U.S.A., 92: 7844-7848 (1995), in the presence of the polypeptide of the invention 
alone or in combination with other growth factors or cytokines. The ability of the 
polypeptide of the invention to induce stem cells proliferation is determined by colony 
formation on semi-solid support e.g. as described by Bernstein et al., Blood, 77: 2316- 
2321 (1991). 

4.7.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of 
hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell 
disorders. Even marginal biological activity in support of colony forming cells or of 
factor-dependent cell lines indicates involvement in regulating hematopoiesis, e.g. in 
supporting the growth and proliferation of erythroid progenitor cells alone or in 
combination with other cytokines, thereby indicating utility, for example, in treating 
various anemias or for use in conjunction with irradiation/chemotherapy to stimulate the 
production of erythroid precursors and/or erythroid cells; in supporting the growth and 
proliferation of myeloid cells such as granulocytes and monocytes/macrophages (i.e., 
traditional CSF activity) useful, for example, in conjunction with chemotherapy to 
prevent or treat consequent myelo-suppression; in supporting the growth and 
proliferation of megakaryocytes and consequently of platelets thereby allowing 
prevention or treatment of various platelet disorders such as thrombocytopenia, and 
generally for use in place of or complimentary to platelet transfusions; and/or in 
supporting the growth and proliferation of hematopoietic stem cells which are capable 
of maturing to any and all of the above-mentioned hematopoietic cells and therefore 
find therapeutic utility in various stem cell disorders (such as those usually treated with 
transplantation, including, without limitation, aplastic anemia and paroxysmal nocturnal 
hemoglobinuria), as well as in repopulating the stem cell compartment post 
irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone 
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marrow transplantation or with peripheral progenitor cell transplantation (homologous 
or heterologous)) as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 

Suitable assays for proliferation and differentiation of various hematopoietic 
5 lines are cited above. 

Assays for embryonic stem cell differentiation (which will identify, among 
others, proteins that influence embryonic differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; 
Keller et al., Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., 
10 Blood 81:2903-2915, 1993. 

Assays for stem cell survival and differentiation (which will identify, among 
others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M. G. In Culture of 
Hematopoietic Cells. R. L Freshney, etal. eds. Vol pp. 265-268, Wiley-Liss, Inc., 
15 New York, N.Y. 1994; Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-5911, 
1992; Primitive hematopoietic colony forming cells with high proliferative potential, 
McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic Cells. R. I. Freshney, 
et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et al., 
Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 
20 Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol 
pp. 1-21, Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in 
the presence of stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of 
Hematopoietic Cells. R. I. Freshney, etal. eds. Vol pp. 163-179, Wiley-Liss, Inc., 
New York, N.Y. 1994; Long term culture initiating cell assay, Sutherland, H. J. In 
25 Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 139-162, 
Wiley-Liss, Inc., New York, N.Y. 1994. 

4.7.6 TISSUE GROWTH ACTIVITY 
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A polypeptide of the present invention also may be involved in bone, cartilage, 
tendon, ligament and/or nerve tissue growth or regeneration, as well as in wound 
healing and tissue repair and replacement, and in healing of burns, incisions and ulcers. 
A polypeptide of the present invention which induces cartilage and/or bone 
5 growth in circumstances where bone is not normally formed, has application in the 
healing of bone fractures and cartilage damage or defects in humans and other animals. 
Compositions of a polypeptide, antibody, binding partner, or other modulator of the 
invention may have prophylactic use in closed as well as open fracture reduction and 
also in the improved fixation of artificial joints. De novo bone formation induced by an 
10 osteogenic agent contributes to the repair of congenital, trauma induced, or oncologic 
resection induced craniofacial defects, and also is useful in cosmetic plastic surgery. 
n A polypeptide of this invention may also be involved in attracting bone-forming 

cells, stimulating growth of bone-forming cells, or inducing differentiation of 
3 'J progenitors of bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone 
; ,15 degenerative disorders, or periodontal disease, such as through stimulation of bone 
* f and/or cartilage repair or by blocking inflammation or processes of tissue destruction 

(collagenase activity, osteoclast activity, etc.) mediated by inflammatory processes may 
, J also be possible using the composition of the invention. 

|; " Another category of tissue regeneration activity that may involve the polypeptide 

:M20 of the present invention is tendon/ligament formation. Induction of 

tendon/ligament-like tissue or other tissue formation in circumstances where such tissue 
is not normally formed, has application in the healing of tendon or ligament tears, 
deformities and other tendon or ligament defects in humans and other animals. Such a 
preparation employing a tendon/ligament-like tissue inducing protein may have 
25 prophylactic use in preventing damage to tendon or ligament tissue, as well as use in 
the improved fixation of tendon or ligament to bone or other tissues, and in repairing 
defects to tendon or ligament tissue. De novo tendon/ligament-like tissue formation 
induced by a composition of the present invention contributes to the repair of 
congenital, trauma induced, or other tendon or ligament defects of other origin, and is 
30 also useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. 
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The compositions of the present invention may provide environment to attract tendon- 
or ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, 
induce differentiation of progenitors of tendon- or ligament-forming cells, or induce 
growth of tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue 
5 repair. The compositions of the invention may also be useful in the treatment of 
tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The 
compositions may also include an appropriate matrix and/or sequestering agent as a 
carrier as is well known in the art. 

The compositions of the present invention may also be useful for proliferation of 
10 neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of 

central and peripheral nervous system diseases and neuropathies, as well as mechanical 
ri and traumatic disorders, which involve degeneration, death or trauma to neural cells or 
)J nerve tissue. More specifically, a composition may be used in the treatment of diseases 

it ; I 

f y of the peripheral nervous system, such as peripheral nerve injuries, peripheral 

HIS neuropathy and localized neuropathies, and central nervous system diseases, such as 

Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, 

b and Shy-Drager syndrome. Further conditions which may be treated in accordance with 

%j the present invention include mechanical and traumatic disorders, such as spinal cord 

f Z, disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral 

! ;?20 neuropathies resulting from chemotherapy or other medical therapies may also be 

S 3 

treatable using a composition of the invention. 

Compositions of the invention may also be useful to promote better or faster 
closure of non-healing wounds, including without limitation pressure ulcers, ulcers 
associated with vascular insufficiency, surgical and traumatic wounds, and the like. 
25 Compositions of the present invention may also be involved in the generation or 

regeneration of other tissues, such as organs (including, for example, pancreas, liver, 
intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular 
(including vascular endothelium) tissue, or for promoting the growth of cells 
comprising such tissues. Part of the desired effects may be by inhibition or modulation 
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of fibrotic scarring may allow normal tissue to regenerate. A polypeptide of the present 
invention may also exhibit angiogenic activity. 

A composition of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various 
tissues, and conditions resulting from systemic cytokine damage. 

A composition of the present invention may also be useful for promoting or 
inhibiting differentiation of tissues described above from precursor tissues or cells; or 
for inhibiting the growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 

Assays for tissue generation activity include, without limitation, those described 
in: International Patent Publication No. WO95/16035 (bone, cartilage, tendon); 
International Patent Publication No. WO95/05846 (nerve, neuronal); International 
Patent Publication No. WO91/07491 (skin, endothelium). 

Assays for wound healing activity include, without limitation, those described 
in: Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., 
eds.), Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and 
Mertz, J. Invest. Dermatol 71:382-84 (1978). 

4.7.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays 
are described herein. A polynucleotide of the invention can encode a polypeptide 
exhibiting such activities. A protein may be useful in the treatment of various immune 
deficiencies and disorders (including severe combined immunodeficiency (SCID)), e.g., 
in regulating (up or down) growth and proliferation of T and/or B lymphocytes, as well 
as effecting the cytolytic activity of NK cells and other cell populations. These immune 
deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or 
fungal infections, or may result from autoimmune disorders. More specifically, 
infectious diseases causes by viral, bacterial, fungal or other infection may be treatable 
using a protein of the present invention, including infections by HIV, hepatitis viruses, 
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herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal 
infections such as candidiasis. Of course, in this regard, proteins of the present 
invention may also be useful where a boost to the immune system generally may be 
desirable, i.e. , in the treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present 
invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, 
Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 
myasthenia gravis, graft- versus-host disease and autoimmune inflammatory eye disease. 
Such a protein (or antagonists thereof, including antibodies) of the present invention 
may also to be useful in the treatment of allergic reactions and conditions (e.g., 
anaphylaxis, serum sickness, drug reactions, food allergies, insect venom allergies, 
mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, angioedema, 
eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, Stevens- 
Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
(particularly allergic asthma) or other respiratory problems. Other conditions, in which 
immune suppression is desired (including, for example, organ transplantation), may 
also be treatable using a protein (or antagonists thereof) of the present invention. The 
therapeutic effects of the polypeptides or antagonists thereof on allergic reactions can be 
evaluated by in vivo animals models such as the cumulative contact enhancement test 
(Lastbom et al., Toxicology 125: 59-66, 1998), skin prick test (Hofftnann et al., 
Allergy 54: 446-54, 1999), guinea pig skin sensitization test (Vohr et al., Arch. 
Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., J. Toxicol. 
Environ. Health 53: 563-79). 

Using the proteins of the invention it may also be possible to modulate immune 
responses, in a number of ways. Down regulation may be in the form of inhibiting or 
blocking an immune response already in progress or may involve preventing the 
induction of an immune response. The functions of activated T cells may be inhibited 
by suppressing T cell responses or by inducing specific tolerance in T cells, or both. 
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Immunosuppression of T cell responses is generally an active, non-antigen-specific, 
process which requires continuous exposure of the T cells to the suppressive agent. 
Tolerance, which involves inducing non-responsiveness or anergy in T cells, is 
distinguishable from immunosuppression in that it is generally antigen-specific and 
persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can 
be demonstrated by the lack of a T cell response upon reexposure to specific antigen in 
the absence of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will be useful in situations of 
tissue, skin and organ transplantation and in graft-versus-host disease (GVHD). For 
example, blockage of T cell function should result in reduced tissue destruction in tissue 
transplantation. Typically, in tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an immune reaction that 
destroys the transplant. The administration of a therapeutic composition of the invention 
may prevent cytokine synthesis by immune cells, such as T cells, and thus acts as an 
immunosuppressant. Moreover, a lack of costimulation may also be sufficient to 
anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term 
tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of 
repeated administration of these blocking reagents. To achieve sufficient 
immunosuppression or tolerance in a subject, it may also be necessary to block the 
function of a combination of B lymphocyte antigens. 

The efficacy of particular therapeutic compositions in preventing organ 
transplant rejection or GVHD can be assessed using animal models that are predictive 
of efficacy in humans. Examples of appropriate systems which can be used include 
allogeneic cardiac grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both 
of which have been used to examine the immunosuppressive effects of CTLA4Ig fusion 
proteins in vivo as described in Lenschow et al., Science 257:789-792 (1992) and 
Turka et al., Proc. Natl. Acad. Sci USA, 89:11102-11105 (1992). In addition, murine 
models of GVHD (see Paul ed., Fundamental Immunology, Raven Press, New York, 
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1989, pp. 846-847) can be used to determine the effect of therapeutic compositions of 
the invention on the development of that disease. 

Blocking antigen function may also be therapeutically useful for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate 
activation of T cells that are reactive against self tissue and which promote the 
production of cytokines and autoantibodies involved in the pathology of the diseases. 
Preventing the activation of autoreactive T cells may reduce or eliminate disease 
symptoms. Administration of reagents which block stimulation of T cells can be used to 
inhibit T cell activation and prevent production of autoantibodies or T cell-derived 
cytokines which may be involved in the disease process. Additionally, blocking 
reagents may induce antigen-specific tolerance of autoreactive T cells which could lead 
to long-term relief from the disease. The efficacy of blocking reagents in preventing or 
alleviating autoimmune disorders can be determined using a number of 
well-characterized animal models of human autoimmune diseases. Examples include 
murine experimental autoimmune encephalitis, systemic lupus erythmatosis in 
MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune collagen arthritis, diabetes 
mellitus in NOD mice and BB rats, and murine experimental myasthenia gravis (see 
Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as 
a means of up regulating immune responses, may also be useful in therapy. 
Upregulation of immune responses may be in the form of enhancing an existing 
immune response or eliciting an initial immune response. For example, enhancing an 
immune response may be useful in cases of viral infection, including systemic viral 
diseases such as influenza, the common cold, and encephalitis. 

Alternatively, anti- viral immune responses may be enhanced in an infected 
patient by removing T cells from the patient, costimulating the T cells in vitro with 
viral antigen-pulsed APCs either expressing a peptide of the present invention or 
together with a stimulatory form of a soluble peptide of the present invention and 
reintroducing the in vitro activated T cells into the patient. Another method of 
enhancing anti-viral immune responses would be to isolate infected cells from a patient, 
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transfect them with a nucleic acid encoding a protein of the present invention as 
described herein such that the cells express all or a portion of the protein on their 
surface, and reintroduce the transfected cells into the patient. The infected cells would 
now be capable of delivering a costimulatory signal to, and thereby activate, T cells in 



A polypeptide of the present invention may provide the necessary stimulation 
signal to T cells to induce a T cell mediated immune response against the transfected 
tumor cells. In addition, tumor cells which lack MHC class I or MHC class II 
molecules, or which fail to reexpress sufficient mounts of MHC class I or MHC class II 

10 molecules, can be transfected with nucleic acid encoding all or a portion of (e.g., a 
cytoplasmic-domain truncated portion) of an MHC class I alpha chain protein and P2 
microglobulin protein or an MHC class II alpha chain protein and an MHC class II beta 
chain protein to thereby express MHC class I or MHC class II proteins on the cell 
surface. Expression of the appropriate class I or class II MHC in conjunction with a 

15 peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces 
a T cell mediated immune response against the transfected tumor cell. Optionally, a 
gene encoding an antisense construct which blocks expression of an MHC class II 
associated protein, such as the invariant chain, can also be cotransfected with a DNA 
encoding a peptide having the activity of a B lymphocyte antigen to promote 

20 presentation of tumor associated antigens and induce tumor specific immunity. Thus, 
the induction of a T cell mediated immune response in a human subject may be 
sufficient to overcome tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

25 Suitable assays for thymocyte or splenocyte cytotoxicity include, without 

limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, 
A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene 
Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); 

30 Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann et al., J. 



5 



vivo. 
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Immunol. 128:1968-1974, 1982; Handaetal., J. Immunol. 135:1564-1572, 1985; 
Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
1988; Bowman et al., J. Virology 61:1992-1998; Bertagnolli et al., Cellular 
Immunology 133:327-341, 1991; Brown etal., J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching 
(which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without limitation, those described 
in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In 
vitro antibody production, Mond, J. J. and Brunswick, M. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, 
Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 
proteins that generate predominantly Thl and CTL responses) include, without 
limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, 
A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene 
Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Takai et 
al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988; 
Bertagnolli etal., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, 
those described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal 
of Experimental Medicine 173:549-559, 1991; Macatonia et al., Journal of 
Immunology 154:5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 
182:255-260, 1995; Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et al., 
Science 264:961-965, 1994; Macatonia et al., Journal of Experimental Medicine 
169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 94:797-807, 
1994; and Inaba et al., Journal of Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, 
proteins that prevent apoptosis after superantigen induction and proteins that regulate 
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lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz 
et al., Cytometry 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; 
Gorczyca et al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 
1991; Zacharchuk, Journal of Immunology 145:4037-4045, 1990; Zamai et al., 
Cytometry 14:891-897, 1993; Gorczyca et al., International Journal of Oncology 
1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and 
development include, without limitation, those described in: Antica et al., Blood 
84:111-117, 1994; Fine et al., Cellular Immunology 155:111-122, 1994; Galy et al., 
Blood 85:2770-2778, 1995; Toki et al., Proc. Nat. Acad Sci. USA 88:7548-7551, 
1991. 

4.7.8 ACTIVIN/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or 
inhibin-related activities. A polynucleotide of the invention may encode a polypeptide 
exhibiting such characteristics. Inhibins are characterized by their ability to inhibit the 
release of follicle stimulating hormone (FSH), while activins and are characterized by 
their ability to stimulate the release of follicle stimulating hormone (FSH). Thus, a 
polypeptide of the present invention, alone or in heterodimers with a member of the 
inhibin family, may be useful as a contraceptive based on the ability of inhibins to 
decrease fertility in female mammals and decrease spermatogenesis in male mammals. 
Administration of sufficient amounts of other inhibins can induce infertility in these 
mammals. Alternatively, the polypeptide of the invention, as a homodimer or as a 
heterodimer with other protein subunits of the inhibin group, may be useful as a fertility 
inducing therapeutic, based upon the ability of activin molecules in stimulating FSH 
release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. 
A polypeptide of the invention may also be useful for advancement of the onset of 
fertility in sexually immature mammals, so as to increase the lifetime reproductive 
performance of domestic animals such as, but not limited to, cows, sheep and pigs. 
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The activity of a polypeptide of the invention may, among other means, be 
measured by the following methods. 

Assays for activin/inhibin activity include, without limitation, those described 
in: Vale et al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 
1986; Vale et al., Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; 
Forage etal., Proc. Natl. Acad. Sci. USA 83:3091-3095, 1986. 

4.7.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 

A polypeptide of the present invention may be involved in chemotactic or 
chemokinetic activity for mammalian cells, including, for example, monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial 
cells. A polynucleotide of the invention can encode a polypeptide exhibiting such 
attributes. Chemotactic and chemokinetic receptor activation can be used to mobilize 
or attract a desired cell population to a desired site of action. Chemotactic or 
chemokinetic compositions (e.g. proteins, antibodies, binding partners, or modulators 
of the invention) provide particular advantages in treatment of wounds and other trauma 
to tissues, as well as in treatment of localized infections. For example, attraction of 
lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in 
improved immune responses against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it 
can stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate 
directed movement of cells. Whether a particular protein has chemotactic activity for a 
population of cells can be readily determined by employing such protein or peptide in 
any known assay for cell chemotaxis. 

Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or 
prevent chemotaxis) consist of assays that measure the ability of a protein to induce the 
migration of cells across a membrane as well as the ability of a protein to induce the 
adhesion of one cell population to another cell population. Suitable assays for 
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movement and adhesion include, without limitation, those described in: Current 
Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D. H. Marguiles, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience 
(Chapter 6. 12, Measurement of alpha and beta Chemokines 6. 12. 1-6. 12.28; Taub et al. 
J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 1995; Muller et 
al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 1994; 
Johnston etal. J. of Immunol. 153:1762-1768, 1994. 

4.7.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or 
thrombolysis or thrombosis. A polynucleotide of the invention can encode a 
polypeptide exhibiting such attributes. Compositions may be useful in treatment of 
various coagulation disorders (including hereditary disorders, such as hemophilias) or 
to enhance coagulation and other hemostatic events in treating wounds resulting from 
trauma, surgery or other causes. A composition of the invention may also be useful for 
dissolving or inhibiting formation of thromboses and for treatment and prevention of 
conditions resulting therefrom (such as, for example, infarction of cardiac and central 
nervous system vessels (e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 
Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., 
Thrombosis Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); 
Schaub, Prostaglandins 35:467-474, 1988. 

4.7.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, 
proliferation or metastasis. Detection of the presence or amount of polynucleotides or 
polypeptides of the invention may be useful for the diagnosis and/or prognosis of one or 
more types of cancer. For example, the presence or increased expression of a 
polynucleotide/polypeptide of the invention may indicate a hereditary risk of cancer, a 
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precancerous condition, or an ongoing malignancy. Conversely, a defect in the gene or 
absence of the polypeptide may be associated with a cancer condition. Identification of 
single nucleotide polymorphisms associated with cancer or a predisposition to cancer 
may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell 
proliferation, inhibiting angiogenesis (growth of new blood vessels that is necessary to 
support tumor growth) and/or prohibiting metastasis by reducing tumor cell motility or 
invasiveness. Therapeutic compositions of the invention may be effective in adult and 
pediatric oncology including in solid phase tumors/malignancies, locally advanced 
tumors, human soft tissue sarcomas, metastatic cancer, including lymphatic metastases, 
blood cell malignancies including multiple myeloma, acute and chronic leukemias, and 
lymphomas, head and neck cancers including mouth cancer, larynx cancer and thyroid 
cancer, lung cancers including small cell carcinoma and non-small cell cancers, breast 
cancers including small cell carcinoma and ductal carcinoma, gastrointestinal cancers 
including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and 
polyps associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic 
cancers including bladder cancer and prostate cancer, malignancies of the female 
genital tract including ovarian carcinoma, uterine (including endometrial) cancers, and 
solid tumor in the ovarian follicle, kidney cancers including renal cell carcinoma, brain 
cancers including intrinsic brain tumors, neuroblastoma, astrocytic brain tumors, 
gliomas, metastatic tumor cell invasion in the central nervous system, bone cancers 
including osteomas, skin cancers including malignant melanoma, tumor progression of 
human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 
hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention 
(including inhibitors and stimulators of the biological activity of the polypeptide of the 
invention) may be administered to treat cancer. Therapeutic compositions can be 
administered in therapeutically effective dosages alone or in combination with adjuvant 
cancer therapy such as surgery, chemotherapy, radiotherapy, thermotherapy, and laser 
therapy, and may provide a beneficial effect, e.g. reducing tumor size, slowing rate of 
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tumor growth, inhibiting metastasis, or otherwise improving overall clinical condition, 
without necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as 
a portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the 
polypeptide or modulator of the invention with one or more anti-cancer drugs in 
addition to a pharmaceutical^ acceptable carrier for delivery. The use of anti-cancer 
cocktails as a cancer treatment is routine. Anti-cancer drugs that are well known in the 
art and can be used as a treatment in combination with the polypeptide or modulator of 
the invention include: Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, 
Busulfan, Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis-DDP), 
Cyclophosphamide, Cytarabine HC1 (Cytosine arabinoside), Dacarbazine, 
Dactinomycin, Daunorubicin HC1, Doxorubicin HC1, Estramustine phosphate sodium, 
Etoposide (V16-213), Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea 
(hydroxy carbamide), Ifosfamide, Interferon Alpha-2a, Interferon Alpha-2b, Leuprolide 
acetate (LHRH-releasing factor analog), Lomustine, Mechlorethamine HC1 (nitrogen 
mustard), Melphalan, Mercaptopurine, Mesna, Methotrexate (MTX), Mitomycin, 
Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, Streptozocin, 
Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 
Amsacrine, Azacitidine, Hexamethylmelamine, Inter leukin-2, Mitoguazone, 
Pentostatin, Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for 
prophylactic treatment of cancer. There are hereditary conditions and/or environmental 
situations (e.g. exposure to carcinogens) known in the art that predispose an individual 
to developing cancers. Under these circumstances, it may be beneficial to treat these 
individuals with therapeutically effective doses of the polypeptide of the invention to 
reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide 
of the invention as a potential cancer treatment. These in vitro models include 
proliferation assays of cultured tumor cells, growth of cultured tumor cells in soft agar 
(see Freshney, (1987) Culture of Animal Cells: A Manual of Basic Technique, Wily- 
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Liss, New York, NY Ch 18 and Ch 21), tumor systems in nude mice as described in 
Giovanella et al., J. Nad. Can. Inst., 52: 921-30 (1974), mobility and invasive potential 
of tumor cells in Boy den Chamber assays as described in Pilkington et al. , Anticancer 
Res., 17: 4107-9 (1997), and angiogenesis assays such as induction of vascularization 
5 of the chick chorioallantoic membrane or induction of vascular endothelial cell 

migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1189-97 (1999) and Li 
et al., Clin. Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines 
are available, e.g. from American Type Tissue Culture Collection catalogs. 



10 4.7.12 RECEPTOR/LI GAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as 
receptor, receptor ligand or inhibitor or agonist of receptor/ligand interactions. A 
polynucleotide of the invention can encode a polypeptide exhibiting such characteristics. 
Examples of such receptors and ligands include, without limitation, cytokine receptors 

15 and their ligands, receptor kinases and their ligands, receptor phosphatases and their 
ligands, receptors involved in cell-cell interactions and their ligands (including without 
limitation, cellular adhesion molecules (such as selectins, integrins and their ligands) 
and receptor/ligand pairs involved in antigen presentation, antigen recognition and 
development of cellular and humoral immune responses. Receptors and ligands are 

20 also useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction. A protein of the present invention (including, without 
limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of 
receptor/ligand interactions. 

The activity of a polypeptide of the invention may, among other means, be 

25 measured by the following methods: 

Suitable assays for receptor-ligand activity include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. 
Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing 
Associates and Wiley- Interscience (Chapter 7.28, Measurement of Cellular Adhesion 

30 under static conditions 7.28.1- 7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 
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84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein et 
al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 
175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor 
5 for a ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may 
be identified through binding assays, affinity chromatography, dihybrid screening 
assays, BIAcore assays, gel overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial 
agonists or a partial antagonist require the use of other proteins as competing ligands. 

10 The polypeptides of the present invention or ligand(s) thereof may be labeled by being 
coupled to radioisotopes, color imetric molecules or a toxin molecules by conventional 
methods. ("Guide to Protein Purification" Murray P. Deutscher (ed) Methods in 
Enzymology Vol. 182 (1990) Academic Press, Inc. San Diego). Examples of 
radioisotopes include, but are not limited to, tritium and carbon- 14 . Examples of 

15 colorimetric molecules include, but are not limited to, fluorescent molecules such as 
fluorescamine, or rhodamine or other colorimetric molecules. Examples of toxins 
include, but are not limited, to ricin. 

4.7.13 DRUG SCREENING 

20 This invention is particularly useful for screening chemical compounds by using 

the novel polypeptides or binding fragments thereof in any of a variety of drug 
screening techniques. The polypeptides or fragments employed in such a test may 
either be free in solution, affixed to a solid support, borne on a cell surface or located 
intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host 

25 cells which are stably transformed with recombinant nucleic acids expressing the 

polypeptide or a fragment thereof. Drugs are screened against such transformed cells 
in competitive binding assays. Such cells, either in viable or fixed form, can be used 
for standard binding assays. One may measure, for example, the formation of 
complexes between polypeptides of the invention or fragments and the agent being 
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tested or examine the diminution in complex formation between the novel polypeptides 
and an appropriate cell line, which are well known in the art. 

Sources for test compounds that may be screened for ability to bind to or 
modulate (i.e., increase or decrease) the activity of polypeptides of the invention 
include (1) inorganic and organic chemical libraries, (2) natural product libraries, and 
(3) combinatorial libraries comprised of either random or mimetic peptides, 
oligonucleotides or organic molecules. 

Chemical libraries may be readily synthesized or purchased from a number of 
commercial sources, and may include structural analogs of known compounds or 
compounds that are identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria 
and fungi), animals, plants or other vegetation, or marine organisms, and libraries of 
mixtures for screening may be created by: (1) fermentation and extraction of broths 
from soil, plant or marine microorganisms or (2) extraction of the organisms 
themselves. Natural product libraries include polyketides, non-ribosomal peptides, and 
(non-naturally occurring) variants thereof. For a review, see Science 252:63-68 
(1998). 

Combinatorial libraries are composed of large numbers of peptides, 
oligonucleotides or organic compounds and can be readily prepared by traditional 
automated synthesis methods, PCR, cloning or proprietary synthetic methods. Of 
particular interest are peptide and oligonucleotide combinatorial libraries. Still other 
libraries of interest include peptide, protein, peptidomimetic, multiparallel synthetic 
collection, recombinatorial, and polypeptide libraries. For a review of combinatorial 
chemistry and libraries created therefrom, see Myers, Curr. Opin. BiotechnoL 8:701- 
707 (1997). For reviews and examples of peptidomimetic libraries, see Al-Obeidi et 
al., Mol Biotechnol, 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol 
1(1): 114-19 (1997); Dorner et al., Bioorg Med Chem, 4(5):709-15 (1996) (alkylated 
dipeptides). 

Identification of modulators through use of the various libraries described herein 
permits modification of the candidate "hit" (or "lead") to optimize the capacity of the 



787CIP2B 



65 



Q 



1 



"hit" to bind a polypeptide of the invention. The molecules identified in the binding 
assay are then tested for antagonist or agonist activity in in vivo tissue culture or animal 
models that are well known in the art. In brief, the molecules are titrated into a 
plurality of cell cultures or animals and then tested for either cell/animal death or 
5 prolonged survival of the animal/cells. 

The binding molecules thus identified may be complexed with toxins, e.g., ricin 
or cholera, or with other compounds that are toxic to cells such as radioisotopes. The 
toxin-binding molecule complex is then targeted to a tumor or other cell by the 
specificity of the binding molecule for a polypeptide of the invention. Alternatively, 
10 the binding molecules may be complexed with imaging agents for targeting and imaging 
purposes. 

4,7,14 ASSAY FOR RECEPTOR ACTIVITY 

The invention also provides methods to detect specific binding of a polypeptide 

15 e.g. a ligand or a receptor. The art provides numerous assays particularly useful for 
identifying previously unknown binding partners for receptor polypeptides of the 
invention. For example, expression cloning using mammalian or bacterial cells, or 
dihybrid screening assays can be used to identify polynucleotides encoding binding 
partners. As another example, affinity chromatography with the appropriate 

20 immobilized polypeptide of the invention can be used to isolate polypeptides that 
recognize and bind polypeptides of the invention. There are a number of different 
libraries used for the identification of compounds, and in particular small molecules, 
that modulate (i.e., increase or decrease) biological activity of a polypeptide of the 
invention. Ligands for receptor polypeptides of the invention can also be identified by 

25 adding exogenous ligands, or cocktails of ligands to two cells populations that are 

genetically identical except for the expression of the receptor of the invention: one cell 
population expresses the receptor of the invention whereas the other does not. The 
response of the two cell populations to the addition of ligands(s) are then compared. 
Alternatively, an expression library can be co-expressed with the polypeptide of the 

30 invention in cells and assayed for an autocrine response to identify potential ligand(s). 
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As still another example, BIAcore assays, gel overlay assays, or other methods known 
in the art can be used to identify binding partner polypeptides, including, (1) organic 
and inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial 
libraries comprised of random peptides, oligonucleotides or organic molecules. 
5 The role of downstream intracellular signaling molecules in the signaling 

cascade of the polypeptide of the invention can be determined. For example, a chimeric 
protein in which the cytoplasmic domain of the polypeptide of the invention is fused to 
the extracellular portion of a protein, whose ligand has been identified, is produced in a 
host cell. The cell is then incubated with the ligand specific for the extracellular portion 
10 of the chimeric protein, thereby activating the chimeric receptor. Known downstream 
proteins involved in intracellular signaling can then be assayed for expected 
modifications i.e. phosphorylation. Other methods known to those in the art can also be 
.: used to identify signaling molecules involved in receptor activity. 

j 15 4.7.15 ANTI-INFLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory 
activity. The anti-inflammatory activity may be achieved by providing a stimulus to 
; cells involved in the inflammatory response, by inhibiting or promoting cell-cell 
- : interactions (such as, for example, cell adhesion), by inhibiting or promoting 

20 chemotaxis of cells involved in the inflammatory process, inhibiting or promoting cell 
extravasation, or by stimulating or suppressing production of other factors which more 
directly inhibit or promote an inflammatory response. Compositions with such activities 
can be used to treat inflammatory conditions including chronic or acute conditions), 
including without limitation intimation associated with infection (such as septic shock, 

25 sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion 
injury, endotoxin lethality, arthritis, complement-mediated hyperacute rejection, 
nephritis, cytokine or chemokine-induced lung injury, inflammatory bowel disease, 
Crohn's disease or resulting from over production of cytokines such as TNF or IL-1. 
Compositions of the invention may also be useful to treat anaphylaxis and 

30 hypersensitivity to an antigenic substance or material. Compositions of this invention 
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may be utilized to prevent or treat conditions such as, but not limited to, sepsis, acute 
pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid arthritis, chronic 
inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, graft 
versus host disease, inflammatory bowel disease, inflamation associated with 
5 pulmonary disease, other autoimmune disease or inflammatory disease, an 

antiproliferative agent such as for acute or chronic mylegenous leukemia or in the 
prevention of premature labor secondary to intrauterine infections. 

4.7.16 LEUKEMIAS 

10 Leukemias and related disorders may be treated or prevented by administration 

of a therapeutic that promotes or inhibits function of the polynucleotides and/or 
polypeptides of the invention. Such leukemias and related disorders include but are not 
limited to acute leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, 
myeloblasts, promyelocytic, myelomonocytic, monocytic, erythroleukemia, chronic 

15 leukemia, chronic myelocytic (granulocytic) leukemia and chronic lymphocytic 

leukemia (for a review of such disorders, see Fishman et al., 1985, Medicine, 2d Ed., 
J.B. Lippincott Co., Philadelphia). 

4.7.17 NERVOUS SYSTEM DISORDERS 

20 Nervous system disorders, involving cell types which can be tested for efficacy 

of intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an 
indication of therapeutic utility, include but are not limited to nervous system injuries, 
and diseases or disorders which result in either a disconnection of axons, a diminution 

25 or degeneration of neurons, or demyelinatiom Nervous system lesions which may be 
treated in a patient (including human and non-human mammalian patients) according to 
the invention include but are not limited to the following lesions of either the central 
(including spinal cord, brain) or peripheral nervous systems: 
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(i) traumatic lesions, including lesions caused by physical injury or 
associated with surgery, for example, lesions which sever a portion of the nervous 
system, or compression injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous 

5 system results in neuronal injury or death, including cerebral infarction or ischemia, or 
spinal cord infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed 
or injured as a result of infection, for example, by an abscess or associated with 
infection by human immunodeficiency virus, herpes zoster, or herpes simplex virus or 

10 with Lyme disease, tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is 
destroyed or injured as a result of a degenerative process including but not limited to 
degeneration associated with Parkinson's disease, Alzheimer's disease, Huntington's 
chorea, or amyotrophic lateral sclerosis; 

15 (v) lesions associated with nutritional diseases or disorders, in which a 

portion of the nervous system is destroyed or injured by a nutritional disorder or 
disorder of metabolism including but not limited to, vitamin B12 deficiency, folic acid 
deficiency, Wernicke disease, tobacco-alcohol amblyopia, Marchiafava-Bignami disease 
(primary degeneration of the corpus callosum), and alcoholic cerebellar degeneration; 

20 (vi) neurological lesions associated with systemic diseases including but not 

limited to diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, 
carcinoma, or sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

25 (viii) demyelinated lesions in which a portion of the nervous system is 

destroyed or injured by a demyelinating disease including but not limited to multiple 
sclerosis, human immunodeficiency virus-associated myelopathy, transverse 
myelopathy or various etiologies, progressive multifocal leukoencephalopathy, and 
central pontine myelinolysis. 
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Therapeutics which are useful according to the invention for treatment of a 
nervous system disorder may be selected by testing for biological activity in promoting 
the survival or differentiation of neurons. For example, and not by way of limitation, 
therapeutics which elicit any of the following effects may be useful according to the 
5 invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in 
vivo, e.g., choline acetyltransferase or acetylcholinesterase with respect to motor 

10 neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the 
method set forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased 

15 sprouting of neurons may be detected by methods set forth in Pestronk et al. (1980, 
Exp. Neurol. 70:65-82) or Brown et al. (1981, Ann. Rev. Neurosci. 4:17-42); 
increased production of neuron-associated molecules may be measured by bioassay, 
enzymatic assay, antibody binding, Northern blot assay, etc., depending on the 
molecule to be measured; and motor neuron dysfunction may be measured by assessing 

20 the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron 
conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according 
to the invention include but are not limited to disorders such as infarction, infection, 
exposure to toxin, trauma, surgical damage, degenerative disease or malignancy that 

25 may affect motor neurons as well as other components of the nervous system, as well as 
disorders that selectively affect neurons such as amyotrophic lateral sclerosis, and 
including but not limited to progressive spinal muscular atrophy, progressive bulbar 
palsy, primary lateral sclerosis, infantile and juvenile muscular atrophy, progressive 
bulbar paralysis of childhood (Fazio-Londe syndrome), poliomyelitis and the post polio 

30 syndrome, and Hereditary Motorsensory Neuropathy (Charcot-Marie-Tooth Disease). 
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4.7.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following 
additional activities or effects: inhibiting the growth, infection or function of, or killing, 
5 infectious agents, including, without limitation, bacteria, viruses, fungi and other 

parasites; effecting (suppressing or enhancing) bodily characteristics, including, without 
limitation, height, weight, hair color, eye color, skin, fat to lean ratio or other tissue 
pigmentation, or organ or body part size or shape (such as, for example, breast 
augmentation or diminution, change in bone form or shape); effecting biorhythms or 

10 circadian cycles or rhythms; effecting the fertility of male or female subjects; effecting 
the metabolism, catabolism, anabolism, processing, utilization, storage or elimination 
of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other 
nutritional factors or component(s); effecting behavioral characteristics, including, 
without limitation, appetite, libido, stress, cognition (including cognitive disorders), 

15 depression (including depressive disorders) and violent behaviors; providing analgesic 
effects or other pain reducing effects; promoting differentiation and growth of 
embryonic stem cells in lineages other than hematopoietic lineages; hormonal or 
endocrine activity; in the case of enzymes, correcting deficiencies of the enzyme and 
treating deficiency-related diseases; treatment of hyperproliferative disorders (such as, 

20 for example, psoriasis); immunoglobulin-like activity (such as, for example, the ability 
to bind antigens or complement); and the ability to act as an antigen in a vaccine 
composition to raise an immune response against such protein or another material or 
entity which is cross-reactive with such protein. 

25 4.7.19 IDENTIFICATION OF POLYMORPHISMS 

The demonstration of polymorphisms makes possible the identification of such 
polymorphisms in human subjects and the pharmacogenetic use of this information for 
diagnosis and treatment. Such polymorphisms may be associated with, e.g., differential 
predisposition or susceptibility to various disease states (such as disorders involving 
30 inflammation or immune response) or a differential response to drug administration, 
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and this genetic information can be used to tailor preventive or therapeutic treatment 
appropriately. For example, the existence of a polymorphism associated with a 
predisposition to inflammation or autoimmune disease makes possible the diagnosis of 
this condition in humans by identifying the presence of the polymorphism. 
5 Polymorphisms can be identified in a variety of ways known in the art which all 

generally involve obtaining a sample from a patient, analyzing DNA from the sample, 
optionally involving isolation or amplification of the DNA, and identifying the presence 
of the polymorphism in the DNA. For example, PCR may be used to amplify an 
appropriate fragment of genomic DNA which may then be sequenced. Alternatively, 

10 the DNA may be subjected to allele-specific oligonucleotide hybridization (in which 
appropriate oligonucleotides are hybridized to the DNA under conditions permitting 
detection of a single base mismatch) or to a single nucleotide extension assay (in which 
an oligonucleotide that hybridizes immediately adjacent to the position of the 
polymorphism is extended with one or more labeled nucleotides). In addition, 

15 traditional restriction fragment length polymorphism analysis (using restriction enzymes 
that provide differential digestion of the genomic DNA depending on the presence or 
absence of the polymorphism) may be performed. Arrays with nucleotide sequences of 
the present invention can be used to detect polymorphisms. The array can comprise 
modified nucleotide sequences of the present invention in order to detect the nucleotide 

20 sequences of the present invention. In the alternative, any one of the nucleotide 

sequences of the present invention can be placed on the array to detect changes from 
those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence 
could also be detected by detecting a corresponding change in amino acid sequence of 
25 the protein, e.g., by an antibody specific to the variant sequence. 



4,7.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against 
rheumatoid arthritis is determined in an experimental animal model system. The 
30 experimental model system is adjuvant induced arthritis in rats, and the protocol is 
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described by J. Holoshitz, et at., 1983, Science, 219:56, or by B. Waksman et al, 
1963, Int. Arch. Allergy Appl. Immunol., 23:129. Induction of the disease can be 
caused by a single injection, generally intradermally, of a suspension of killed 
Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The route of 
5 injection can vary, but rats may be injected at the base of the tail with an adjuvant 

mixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose 
of about 1-5 mg/kg. The control consists of administering PBS only. 

The procedure for testing the effects of the test compound would consist of 
intradermally injecting killed Mycobacterium tuberculosis in CFA followed by 
10 immediately administering the test compound and subsequent treatment every other day 
until day 24. At 14, 15, 18, 20, 22, and 24 days after injection of Mycobacterium 
CFA, an overall arthritis score may be obtained as described by J. Holoskitz above. An 
analysis of the data would reveal that the test compound would have a dramatic affect 
on the swelling of the joints as measured by a decrease of the arthritis score. 

15 

4.8 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and 
antibodies or other binding partners or modulators including antisense polynucleotides) 
20 of the invention have numerous applications in a variety of therapeutic methods. 

Examples of therapeutic applications include, but are not limited to, those exemplified 
herein. 

4.8,1 EXAMPLE 

25 One embodiment of the invention is the administration of an effective amount of 

the polypeptides or other composition of the invention to individuals affected by a 
disease or disorder that can be modulated by regulating the peptides of the invention. 
While the mode of administration is not particularly important, parenteral 
administration is preferred. An exemplary mode of administration is to deliver an 

30 intravenous bolus. The dosage of the polypeptides or other composition of the 
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invention will normally be determined by the prescribing physician. It is to be expected 
that the dosage will vary according to the age, weight, condition and response of the 
individual patient. Typically, the amount of polypeptide administered per dose will be 
in the range of about 0.01|ag/kg to 100 mg/kg of body weight, with the preferred dose 
5 being about 0. l^g/kg to 10 mg/kg of patient body weight. For parenteral 

administration, polypeptides of the invention will be formulated in an injectable form 
combined with a pharmaceutically acceptable parenteral vehicle. Such vehicles are well 
known in the art and examples include water, saline, Ringer's solution, dextrose 
solution, and solutions consisting of small amounts of the human serum albumin. The 
10 vehicle may contain minor amounts of additives that maintain the isotonicity and 

stability of the polypeptide or other active ingredient. The preparation of such solutions 
is within the skill of the art. 



15 4.9 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

A protein or other composition of the present invention (from whatever source 
derived, including without limitation from recombinant and non-recombinant sources 
and including antibodies and other binding partners of the polypeptides of the invention) 

20 may be administered to a patient in need, by itself, or in pharmaceutical compositions 
where it is mixed with suitable carriers or excipient(s) at doses to treat or ameliorate a 
variety of disorders. Such a composition may optionally contain (in addition to protein 
or other active ingredient and a carrier) diluents, fillers, salts, buffers, stabilizers, 
solubilizers, and other materials well known in the art. The term "pharmaceutically 

25 acceptable" means a non-toxic material that does not interfere with the effectiveness of 
the biological activity of the active ingredient(s). The characteristics of the carrier will 
depend on the route of administration. The pharmaceutical composition of the 
invention may also contain cytokines, lymphokines, or other hematopoietic factors such 
as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, 

30 IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IFN, TNF0, TNF1, TNF2, G-CSF, 
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Meg-CSF, thrombopoietin, stem cell factor, and erythropoietin. In further 
compositions, proteins of the invention may be combined with other agents beneficial to 
the treatment of the disease or disorder in question. These agents include various 
growth factors such as epidermal growth factor (EGF), platelet-derived growth factor 
5 (PDGF), transforming growth factors (TGF-a and TGF-(3), insulin-like growth factor 
(IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents which either 
enhance the activity of the protein or other active ingredient or complement its activity 
or use in treatment. Such additional factors and/or agents may be included in the 

10 pharmaceutical composition to produce a synergistic effect with protein or other active 
ingredient of the invention, or to minimize side effects. Conversely, protein or other 
active ingredient of the present invention may be included in formulations of the 
particular clotting factor, cytokine, lymphokine, other hematopoietic factor, 
thrombolytic or anti-thrombotic factor, or anti- inflammatory agent to minimize side 

15 effects of the clotting factor, cytokine, lymphokine, other hematopoietic factor, 

thrombolytic or anti-thrombotic factor, or anti-inflammatory agent (such as IL-IRa, IL- 
1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein of 
the present invention may be active in multimers (e.g. , heterodimers or homodimers) or 
complexes with itself or other proteins. As a result, pharmaceutical compositions of the 

20 invention may comprise a protein of the invention in such multimeric or complexed 
form. 

As an alternative to being included in a pharmaceutical composition of the 
invention including a first protein, a second protein or a therapeutic agent may be 
concurrently administered with the first protein (e.g., at the same time, or at differing 

25 times provided that therapeutic concentrations of the combination of agents is achieved 
at the treatment site). Techniques for formulation and administration of the compounds 
of the instant application may be found in "Remington's Pharmaceutical Sciences," 
Mack Publishing Co., Easton, PA, latest edition. A therapeutically effective dose 
further refers to that amount of the compound sufficient to result in amelioration of 

30 symptoms, e.g. , treatment, healing, prevention or amelioration of the relevant medical 
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condition, or an increase in rate of treatment, healing, prevention or amelioration of 
such conditions. When applied to an individual active ingredient, administered alone, a 
therapeutically effective dose refers to that ingredient alone. When applied to a 
combination, a therapeutically effective dose refers to combined amounts of the active 
5 ingredients that result in the therapeutic effect, whether administered in combination, 
serially or simultaneously. 

In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein or other active ingredient of the present 
invention is administered to a mammal having a condition to be treated. Protein or 

10 other active ingredient of the present invention may be administered in accordance with 
the method of the invention either alone or in combination with other therapies such as 
treatments employing cytokines, lymphokines or other hematopoietic factors. When 
co- administered with one or more cytokines, lymphokines or other hematopoietic 
factors, protein or other active ingredient of the present invention may be administered 

15 either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic 

factor (s), thrombolytic or anti-thrombotic factors, or sequentially. If administered 
sequentially, the attending physician will decide on the appropriate sequence of 
administering protein or other active ingredient of the present invention in combination 
with cytokine(s), lymphokine(s), other hematopoietic factor (s), thrombolytic or 

20 anti-thrombotic factors. 



4.9.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, 
transmucosal, or intestinal administration; parenteral delivery, including intramuscular, 
25 subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, 
intravenous, intraperitoneal, intranasal, or intraocular injections. Administration of 
protein or other active ingredient of the present invention used in the pharmaceutical 
composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 
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cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. 
Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic 
manner, for example, via injection of the compound directly into a arthritic joints or in 
5 fibrotic tissue, often in a depot or sustained release formulation. In order to prevent the 
scarring process frequently occurring as complication of glaucoma surgery, the 
compounds may be administered topically, for example, as eye drops. Furthermore, 
one may administer the drug in a targeted drug delivery system, for example, in a 
liposome coated with a specific antibody, targeting, for example, arthritic or fibrotic 
10 tissue. The liposomes will be targeted to and taken up selectively by the afflicted 
tissue. 

The polypeptides of the invention are administered by any route that delivers an 
effective dosage to the desired site of action. The determination of a suitable route of 
administration and an effective dosage for a particular indication is within the level of 
15 skill in the art. Preferably for wound treatment, one administers the therapeutic 
compound directly to the site. Suitable dosage ranges for the polypeptides of the 
invention can be extrapolated from these dosages or from similar studies in appropriate 
animal models. Dosages can then be adjusted as necessary by the clinician to provide 
maximal therapeutic benefit. 

20 

4.9.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention 
thus may be formulated in a conventional manner using one or more physiologically 
acceptable carriers comprising excipients and auxiliaries which facilitate processing of 

25 the active compounds into preparations which can be used pharmaceutically. These 
pharmaceutical compositions may be manufactured in a manner that is itself known, 
e.g., by means of conventional mixing, dissolving, granulating, dragee-making, 
levigating, emulsifying, encapsulating, entrapping or lyophilizing processes. Proper 
formulation is dependent upon the route of administration chosen. When a 

30 therapeutically effective amount of protein or other active ingredient of the present 
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invention is administered orally, protein or other active ingredient of the present 
invention will be in the form of a tablet, capsule, powder, solution or elixir. When 
administered in tablet form, the pharmaceutical composition of the invention may 
additionally contain a solid carrier such as a gelatin or an adjuvant. The tablet, capsule, 
5 and powder contain from about 5 to 95 % protein or other active ingredient of the 
present invention, and preferably from about 25 to 90% protein or other active 
ingredient of the present invention. When administered in liquid form, a liquid carrier 
such as water, petroleum, oils of animal or plant origin such as peanut oil, mineral oil, 
soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 

10 pharmaceutical composition may further contain physiological saline solution, dextrose 
or other saccharide solution, or glycols such as ethylene glycol, propylene glycol or 
polyethylene glycol. When administered in liquid form, the pharmaceutical 
composition contains from about 0.5 to 90% by weight of protein or other active 
ingredient of the present invention, and preferably from about 1 to 50% protein or 

15 other active ingredient of the present invention. 

When a therapeutically effective amount of protein or other active ingredient of 
the present invention is administered by intravenous, cutaneous or subcutaneous 
injection, protein or other active ingredient of the present invention will be in the form 
of a pyrogen-free, parenterally acceptable aqueous solution. The preparation of such 

20 parenterally acceptable protein or other active ingredient solutions, having due regard 
to pH, isotonicity, stability, and the like, is within the skill in the art. A preferred 
pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection 
should contain, in addition to protein or other active ingredient of the present invention, 
an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, Dextrose 

25 Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or 
other vehicle as known in the art. The pharmaceutical composition of the present 
invention may also contain stabilizers, preservatives, buffers, antioxidants, or other 
additives known to those of skill in the art. For injection, the agents of the invention 
may be formulated in aqueous solutions, preferably in physiologically compatible 

30 buffers such as Hanks' s solution, Ringer's solution, or physiological saline buffer. For 
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transmucosal administration, penetrants appropriate to the barrier to be permeated are 
used in the formulation. Such penetrants are generally known in the art. 

For oral administration, the compounds can be formulated readily by combining 
the active compounds with pharmaceutical^ acceptable carriers well known in the art. 
5 Such carriers enable the compounds of the invention to be formulated as tablets, pills, 
dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral 
ingestion by a patient to be treated. Pharmaceutical preparations for oral use can be 
obtained from a solid excipient, optionally grinding a resulting mixture, and processing 
the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or 

10 dragee cores. Suitable excipients are, in particular, fillers such as sugars, including 
lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, 
maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl 
cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or 
polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, such as 

15 the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as 
sodium alginate. Dragee cores are provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, 
lacquer solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or 

20 pigments may be added to the tablets or dragee coatings for identification or to 
characterize different combinations of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules 
made of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such 
as glycerol or sorbitol. The push-fit capsules can contain the active ingredients in 

25 admixture with filler such as lactose, binders such as starches, and/or lubricants such as 
talc or magnesium stearate and, optionally, stabilizers. In soft capsules, the active 
compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid 
paraffin, or liquid polyethylene glycols. In addition, stabilizers may be added. All 
formulations for oral administration should be in dosages suitable for such 
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administration. For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the 
present invention are conveniently delivered in the form of an aerosol spray 
5 presentation from pressurized packs or a nebuliser, with the use of a suitable 
propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a 
pressurized aerosol the dosage unit may be determined by providing a valve to deliver a 
metered amount. Capsules and cartridges of, e.g. , gelatin for use in an inhaler or 

10 insufflator may be formulated containing a powder mix of the compound and a suitable 
powder base such as lactose or starch. The compounds may be formulated for 
parenteral administration by injection, e.g., by bolus injection or continuous infusion. 
Formulations for injection may be presented in unit dosage form, e.g. , in ampules or in 
multi-dose containers, with an added preservative. The compositions may take such 

15 forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may 
contain formulatory agents such as suspending, stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous 
solutions of the active compounds in water-soluble form. Additionally, suspensions of 
the active compounds may be prepared as appropriate oily injection suspensions, 

20 Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic 
fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Aqueous injection 
suspensions may contain substances which increase the viscosity of the suspension, such 
as sodium carboxymethyl cellulose, sorbitol, or dextran. Optionally, the suspension 
may also contain suitable stabilizers or agents which increase the solubility of the 

25 compounds to allow for the preparation of highly concentrated solutions. Alternatively, 
the active ingredient may be in powder form for constitution with a suitable vehicle, 
e.g. , sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as 
suppositories or retention enemas, e.g., containing conventional suppository bases such 

30 as cocoa butter or other glycerides. In addition to the formulations described 
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previously, the compounds may also be formulated as a depot preparation. Such long 
acting formulations may be administered by implantation (for example subcutaneously 
or intramuscularly) or by intramuscular injection. Thus, for example, the compounds 
may be formulated with suitable polymeric or hydrophobic materials (for example as an 

5 emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble 
derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a 
co-solvent system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible 
organic polymer, and an aqueous phase. The co-solvent system may be the VPD 

10 co-solvent system. VPD is a solution of 3% w/v benzyl alcohol, 8% w/v of the 

nonpolar surfactant polysorbate 80, and 65% w/v polyethylene glycol 300, made up to 
volume in absolute ethanol. The VPD co-solvent system (VPD:5W) consists of VPD 
diluted 1:1 with a 5% dextrose in water solution. This co-solvent system dissolves 
hydrophobic compounds well, and itself produces low toxicity upon systemic 

15 administration. Naturally, the proportions of a co-solvent system may be varied 

considerably without destroying its solubility and toxicity characteristics. Furthermore, 
the identity of the co-solvent components may be varied: for example, other 
low-toxicity nonpolar surfactants may be used instead of polysorbate 80; the fraction 
size of polyethylene glycol may be varied; other biocompatible polymers may replace 

20 polyethylene glycol, e.g. polyvinyl pyrrolidone; and other sugars or polysaccharides 
may substitute for dextrose. Alternatively, other delivery systems for hydrophobic 
pharmaceutical compounds may be employed. Liposomes and emulsions are well 
known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic 
solvents such as dimethylsulfoxide also may be employed, although usually at the cost 

25 of greater toxicity. Additionally, the compounds may be delivered using a 

sustained-release system, such as semipermeable matrices of solid hydrophobic 
polymers containing the therapeutic agent. Various types of sustained-release materials 
have been established and are well known by those skilled in the art. Sustained-release 
capsules may, depending on their chemical nature, release the compounds for a few 

30 weeks up to over 100 days. Depending on the chemical nature and the biological 
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stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase 
carriers or excipients. Examples of such carriers or excipients include but are not 

5 limited to calcium carbonate, calcium phosphate, various sugars, starches, cellulose 
derivatives, gelatin, and polymers such as polyethylene glycols. Many of the active 
ingredients of the invention may be provided as salts with pharmaceutically compatible 
counter ions. Such pharmaceutically acceptable base addition salts are those salts which 
retain the biological effectiveness and properties of the free acids and which are 

10 obtained by reaction with inorganic or organic bases such as sodium hydroxide, 

magnesium hydroxide, ammonia, trialkylamine, dialkylamine, monoalkylamine, dibasic 
amino acids, sodium acetate, potassium benzoate, triethanol amine and the like. 

The pharmaceutical composition of the invention may be in the form of a 
complex of the protein(s) or other active ingredient(s) of present invention along with 

15 protein or peptide antigens. The protein and/or peptide antigen will deliver a 

stimulatory signal to both B and T lymphocytes. B lymphocytes will respond to antigen 
through their surface immunoglobulin receptor. T lymphocytes will respond to antigen 
through the T cell receptor (TCR) following presentation of the antigen by MHC 
proteins. MHC and structurally related proteins including those encoded by class I and 

20 class II MHC genes on host cells will serve to present the peptide antigen(s) to T 

lymphocytes. The antigen components could also be supplied as purified MHC-peptide 
complexes alone or with co-stimulatory molecules that can directly signal T cells. 
Alternatively antibodies able to bind surface immunoglobulin and other molecules on B 
cells as well as antibodies able to bind the TCR and other molecules on T cells can be 

25 combined with the pharmaceutical composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a 
liposome in which protein of the present invention is combined, in addition to other 
pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist 
in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers 

30 in aqueous solution. Suitable lipids for liposomal formulation include, without 
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limitation, monoglycerides, diglycerides, sulfatides, lysolecithins, phospholipids, 
saponin, bile acids, and the like. Preparation of such liposomal formulations is within 
the level of skill in the art, as disclosed, for example, in U.S. Patent Nos. 4,235,871; 
4,501,728; 4,837,028; and 4,737,323, all of which are incorporated herein by 
reference. 

The amount of protein or other active ingredient of the present invention in the 
pharmaceutical composition of the present invention will depend upon the nature and 
severity of the condition being treated, and on the nature of prior treatments which the 
patient has undergone. Ultimately, the attending physician will decide the amount of 
protein or other active ingredient of the present invention with which to treat each 
individual patient. Initially, the attending physician will administer low doses of 
protein or other active ingredient of the present invention and observe the patient's 
response. Larger doses of protein or other active ingredient of the present invention 
may be administered until the optimal therapeutic effect is obtained for the patient, and 
at that point the dosage is not increased further. It is contemplated that the various 
pharmaceutical compositions used to practice the method of the present invention 
should contain about 0.01 jug to about 100 mg (preferably about 0.1 fig to about 10 mg, 
more preferably about 0.1 jug to about 1 mg) of protein or other active ingredient of the 
present invention per kg body weight. For compositions of the present invention which 
are useful for bone, cartilage, tendon or ligament regeneration, the therapeutic method 
includes administering the composition topically, systematically, or locally as an 
implant or device. When administered, the therapeutic composition for use in this 
invention is, of course, in a pyrogen-free, physiologically acceptable form. Further, 
the composition may desirably be encapsulated or injected in a viscous form for 
delivery to the site of bone, cartilage or tissue damage. Topical administration may be 
suitable for wound healing and tissue repair. Therapeutically useful agents other than a 
protein or other active ingredient of the invention which may also optionally be 
included in the composition as described above, may alternatively or additionally, be 
administered simultaneously or sequentially with the composition in the methods of the 
invention. Preferably for bone and/or cartilage formation, the composition would 
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include a matrix capable of delivering the protein-containing or other active 
ingredient-containing composition to the site of bone and/or cartilage damage, 
providing a structure for the developing bone and cartilage and optimally capable of 
being resorbed into the body. Such matrices may be formed of materials presently in 
5 use for other implanted medical applications. 

The choice of matrix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate formulation. Potential 
matrices for the compositions may be biodegradable and chemically defined calcium 
10 sulfate, tricalcium phosphate, hydroxy apatite, polylactic acid, polyglycolic acid and 
poly anhydrides. Other potential materials are biodegradable and biologically 
well-defined, such as bone or dermal collagen. Further matrices are comprised of pure 
;l; proteins or extracellular matrix components. Other potential matrices are 

nonbiodegradable and chemically defined, such as sintered hydroxy apatite, bioglass, 
; L j 15 aluminates, or other ceramics. Matrices may be comprised of combinations of any of 
J : the above mentioned types of material, such as polylactic acid and hydroxyapatite or 
collagen and tricalcium phosphate. The bioceramics may be altered in composition, 
4 such as in calcium-aluminate-phosphate and processing to alter pore size, particle size, 
:' v particle shape, and biodegradability. Presently preferred is a 50:50 (mole weight) 
? : : 20 copolymer of lactic acid and gly colic acid in the form of porous particles having 

diameters ranging from 150 to 800 microns. In some applications, it will be useful to 
utilize a sequestering agent, such as carboxymethyl cellulose or autologous blood clot, 
to prevent the protein compositions from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
25 alkylcelluloses (including hydroxy alkylcelluloses), including methylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, 

hydroxypropyl-methylcellulose, and carboxymethylcellulose, the most preferred being 
cationic salts of carboxymethylcellulose (CMC). Other preferred sequestering agents 
include hyaluronic acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, 
30 carboxy vinyl polymer and polyvinyl alcohol). The amount of sequestering agent 
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useful herein is 0.5-20 wt %, preferably 1-10 wt % based on total formulation weight, 
which represents the amount necessary to prevent desorption of the protein from the 
polymer matrix and to provide appropriate handling of the composition, yet not so 
much that the progenitor cells are prevented from infiltrating the matrix, thereby 
5 providing the protein the opportunity to assist the osteogenic activity of the progenitor 
cells. In further compositions, proteins or other active ingredients of the invention may 
be combined with other agents beneficial to the treatment of the bone and/or cartilage 
defect, wound, or tissue in question. These agents include various growth factors such 
as epidermal growth factor (EGF), platelet derived growth factor (PDGF), transforming 

10 growth factors (TGF-a and TGF-P), and insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary 
applications. Particularly domestic animals and thoroughbred horses, in addition to 
humans, are desired patients for such treatment with proteins or other active ingredients 
of the present invention. The dosage regimen of a protein-containing pharmaceutical 

15 composition to be used in tissue regeneration will be determined by the attending 
physician considering various factors which modify the action of the proteins, e.g., 
amount of tissue weight desired to be formed, the site of damage, the condition of the 
damaged tissue, the size of a wound, type of damaged tissue (e.g. , bone), the patient's 
age, sex, and diet, the severity of any infection, time of administration and other 

20 clinical factors. The dosage may vary with the type of matrix used in the reconstitution 
and with inclusion of other proteins in the pharmaceutical composition. For example, 
the addition of other known growth factors, such as IGF I (insulin like growth factor I), 
to the final composition, may also effect the dosage. Progress can be monitored by 
periodic assessment of tissue/bone growth and/or repair, for example, X-rays, 

25 histomorphometric determinations and tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a cell or organism (including, 

30 without limitation, in the form of viral vectors or naked DNA). Cells may also be 
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cultured ex vivo in the presence of proteins of the present invention in order to 
proliferate or to produce a desired effect on or activity in such cells. Treated cells can 
then be introduced in vivo for therapeutic purposes. 

4.9.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to 
achieve its intended purpose. More specifically, a therapeutically effective amount 
means an amount effective to prevent development of or to alleviate the existing 
symptoms of the subject being treated. Determination of the effective amount is well 
within the capability of those skilled in the art, especially in light of the detailed 
disclosure provided herein. For any compound used in the method of the invention, the 
therapeutically effective dose can be estimated initially from appropriate in vitro assays. 
For example, a dose can be formulated in animal models to achieve a circulating 
concentration range that can be used to more accurately determine useful doses in 
humans. For example, a dose can be formulated in animal models to achieve a 
circulating concentration range that includes the ICso as determined in cell culture (i.e. , 
the concentration of the test compound which achieves a half-maximal inhibition of the 
protein's biological activity). Such information can be used to more accurately 
determine useful doses in humans. 

A therapeutically effective dose refers to that amount of the compound that 
results in amelioration of symptoms or a prolongation of survival in a patient. Toxicity 
and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio between 
LD50 and ED50. Compounds which exhibit high therapeutic indices are preferred. The 
data obtained from these cell culture assays and animal studies can be used in 
formulating a range of dosage for use in human. The dosage of such compounds lies 
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preferably within a range of circulating concentrations that include the ED50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual physician in view of the 
5 patient's condition. See, e.g. , Fingl et al., 1975, in "The Pharmacological Basis of 
Therapeutics", Ch. 1 p. 1. Dosage amount and interval may be adjusted individually to 
provide plasma levels of the active moiety which are sufficient to maintain the desired 
effects, or minimal effective concentration (MEC). The MEC will vary for each 
compound but can be estimated from in vitro data. Dosages necessary to achieve the 
10 MEC will depend on individual characteristics and route of administration. However, 
HPLC assays or bioassays can be used to determine plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should 
be administered using a regimen which maintains plasma levels above the MEC for 
10-90% of the time, preferably between 30-90% and most preferably between 50-90%. 
15 In cases of local administration or selective uptake, the effective local concentration of 
the drug may not be related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the 
invention will be in the range of about 0.01 (ag/kg to 100 mg/kg of body weight daily, 
with the preferred dose being about 0.1 |^g/kg to 25 mg/kg of patient body weight 
20 daily, varying in adults and children. Dosing may be once daily, or equivalent doses 
may be delivered at longer or shorter intervals. 

The amount of composition administered will, of course, be dependent on the 
subject being treated, on the subject's age and weight, the severity of the affliction, the 
manner of administration and the judgment of the prescribing physician. 

25 

4.9.4 PACKAGING 

The compositions may, if desired, be presented in a pack or dispenser device 
which may contain one or more unit dosage forms containing the active ingredient. 
The pack may, for example, comprise metal or plastic foil, such as a blister pack. The 
30 pack or dispenser device may be accompanied by instructions for administration. 
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Compositions comprising a compound of the invention formulated in a compatible 
pharmaceutical carrier may also be prepared, placed in an appropriate container, and 
labeled for treatment of an indicated condition. 

5 4.10 ANTIBODIES 

Another aspect of the invention is an antibody that specifically binds the 
polypeptide of the invention. Such antibodies include monoclonal and polyclonal 
antibodies, single chain antibodies, chimeric antibodies, bifunctional/bispecific 
antibodies, humanized antibodies, human antibodies, and complementary determining 

10 region (CDR) -grafted antibodies, including compounds which include CDR and/or 

antigen-binding sequences, which specifically recognize a polypeptide of the invention. 
Preferred antibodies of the invention are human antibodies which are produced and 
identified according to methods described in W093/11236, published June 20, 1993, 
which is incorporated herein by reference in its entirety. Antibody fragments, 

15 including Fab, Fab', F(ab")2, and F v , are also provided by the invention. The term 
"specific for" indicates that the variable regions of the antibodies of the invention 
recognize and bind polypeptides of the invention exclusively (i.e. , able to distinguish 
the polypeptide of the invention from other similar polypeptides despite sequence 
identity, homology, or similarity found in the family of polypeptides), but may also 

20 interact with other proteins (for example, S. aureus protein A or other antibodies in 
ELISA techniques) through interactions with sequences outside the variable region of 
the antibodies, and in particular, in the constant region of the molecule. Screening 
assays to determine binding specificity of an antibody of the invention are well known 
and routinely practiced in the art. For a comprehensive discussion of such assays, see 

25 Harlow et al. (Eds), Antibodies A Laboratory Manual; Cold Spring Harbor Laboratory; 
Cold Spring Harbor , NY (1988), Chapter 6. Antibodies that recognize and bind 
fragments of the polypeptides of the invention are also contemplated, provided that the 
antibodies are first and foremost specific for, as defined above, full length polypeptides 
of the invention. As with antibodies that are specific for full length polypeptides of the 

30 invention, antibodies of the invention that recognize fragments are those which can 
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distinguish polypeptides from the same family of polypeptides despite inherent sequence 
identity, homology, or similarity found in the family of proteins. Antibodies of the 
invention can be produced using any method well known and routinely practiced in the 
art. 

5 Non-human antibodies may be humanized by any methods known in the art. In 

one method, the non-human CDRs are inserted into a human antibody or consensus 
antibody framework sequence. Further changes can then be introduced into the 
antibody framework to modulate affinity or immunogenicity. 

Antibodies of the invention are useful for, for example, therapeutic purposes (by 

10 modulating activity of a polypeptide of the invention), diagnostic purposes to detect or 
quantitate a polypeptide of the invention, as well as purification of a polypeptide of the 
invention. Kits comprising an antibody of the invention for any of the purposes 
described herein are also comprehended. In general, a kit of the invention also includes 
a control antigen for which the antibody is immunospecific. The invention further 

15 provides a hybridoma that produces an antibody according to the invention. Antibodies 
of the invention are useful for detection and/or purification of the polypeptides of the 
invention. 

Polypeptides of the invention may also be used to immunize animals to obtain 
polyclonal and monoclonal antibodies which specifically react with the protein. Such 

20 antibodies may be obtained using either the entire protein or fragments thereof as an 
immunogen. The peptide immunogens additionally may contain a cysteine residue at 
the carboxyl terminus, and are conjugated to a hapten such as keyhole limpet 
hemocyanin (KLH). Methods for synthesizing such peptides are known in the art, for 
example, as in R. P. Merrifield, J. Amer. Chem. Soc. 85, 2149-2154 (1963); J. L 

25 Krstenansky, et al., FEBS Lett. 211, 10 (1987). 

Monoclonal antibodies binding to the protein of the invention may be useful 
diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 

30 abnormal expression of the protein is involved. In the case of cancerous cells or 
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leukemic cells, neutralizing monoclonal antibodies against the protein may be useful in 
detecting and preventing the metastatic spread of the cancerous cells, which may be 
mediated by the protein. In general, techniques for preparing polyclonal and 
monoclonal antibodies as well as hybridomas capable of producing the desired antibody 
5 are well known in the art (Campbell, A.M., Monoclonal Antibodies Technology: 
Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science 
Publishers, Amsterdam, The Netherlands (1984); St. Groth et al., J. Immunol. 35:1-21 
(1990); Kohler and Milstein, Nature 256:495-497 (1975)), the trioma technique, the 
human B-cell hybridoma technique (Kozbor et al., Immunology Today 4:72 (1983); 
10 Cole et al., in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985), 
pp. 77-96). 

Any animal (mouse, rabbit, etc.) which is known to produce antibodies can be 
immunized with a peptide or polypeptide of the invention. Methods for immunization 
are well known in the art. Such methods include subcutaneous or intraperitoneal 

15 injection of the polypeptide. One skilled in the art will recognize that the amount of the 
protein encoded by the ORF of the present invention used for immunization will vary 
based on the animal which is immunized, the antigenicity of the peptide and the site of 
injection. The protein that is used as an immunogen may be modified or administered 
in an adjuvant in order to increase the protein's antigenicity. Methods of increasing the 

20 antigenicity of a protein are well known in the art and include, but are not limited to, 
coupling the antigen with a heterologous protein (such as globulin or -galactosidase) 
or through the inclusion of an adjuvant during immunization. 

For monoclonal antibodies, spleen cells from the immunized animals are 
removed, fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, and allowed 

25 to become monoclonal antibody producing hybridoma cells. Any one of a number of 
methods well known in the art can be used to identify the hybridoma cell which 
produces an antibody with the desired characteristics. These include screening the 
hybridomas with an ELISA assay, Western blot analysis, or radioimmunoassay (Lutz et 
al., Exp. Cell Research. 175:109-124 (1988)). Hybridomas secreting the desired 

30 antibodies are cloned and the class and subclass is determined using procedures known 
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in the art (Campbell, A.M., Monoclonal Antibody Technology: Laboratory 
Techniques in Biochemistry and Molecular Biology, Elsevier Science Publishers, 
Amsterdam, The Netherlands (1984)). Techniques described for the production of 
single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce single chain 
5 antibodies to proteins of the present invention. 

For polyclonal antibodies, antibody-containing antiserum is isolated from the 
immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. The present invention further 
provides the above- described antibodies in delectably labeled form. Antibodies can be 

10 delectably labeled through the use of radioisotopes, affinity labels (such as biotin, 

avidin, etc.), enzymatic labels (such as horseradish peroxidase, alkaline phosphatase, 
etc.) fluorescent labels (such as FITC or rhodamine, etc.), paramagnetic atoms, etc. 
Procedures for accomplishing such labeling are well-known in the art, for example, see 
(Sternberger, L.A. et al., J. Histochem. Cytochem. 18:315 (1970); Bayer, E.A. et aL, 

15 Meth. Enzym. 62:308 (1979); Engval, E. et al., Immunol. 109:129 (1972); Goding, 
J.W. J. Immunol. Meth. 13:215 (1976)). 

The labeled antibodies of the present invention can be used for in vitro, in vivo, 
and in situ assays to identify cells or tissues in which a fragment of the polypeptide of 
interest is expressed. The antibodies may also be used directly in therapies or other 

20 diagnostics. The present invention further provides the above-described antibodies 

immobilized on a solid support. Examples of such solid supports include plastics such 
as polycarbonate, complex carbohydrates such as agarose and Sepharose®, acrylic 
resins and such as polyacrylamide and latex beads. Techniques for coupling antibodies 
to such solid supports are well known in the art (Weir, D.M. et al., "Handbook of 

25 Experimental Immunology" 4th Ed., Blackwell Scientific Publications, Oxford, 

England, Chapter 10 (1986); Jacoby, W.D. et al., Meth. Enzym. 34 Academic Press, 
N.Y. (1974)). The immobilized antibodies of the present invention can be used for in 
vitro, in vivo, and in situ assays as well as for immuno-affinity purification of the 
proteins of the present invention. 

30 
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4.11 COMPUTER READABLE SEQUENCES 

In one application of this embodiment, a nucleotide sequence of the present 
invention can be recorded on computer readable media. As used herein, "computer 
readable media" refers to any medium which can be read and accessed directly by a 
5 computer. Such media include, but are not limited to: magnetic storage media, such as 
floppy discs, hard disc storage medium, and magnetic tape; optical storage media such 
as CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these 
categories such as magnetic/optical storage media. A skilled artisan can readily 
appreciate how any of the presently known computer readable mediums can be used to 
10 create a manufacture comprising computer readable medium having recorded thereon a 
nucleotide sequence of the present invention. As used herein, "recorded" refers to a 
;\ process for storing information on computer readable medium. A skilled artisan can 
I readily adopt any of the presently known methods for recording information on 

I ; computer readable medium to generate manufactures comprising the nucleotide 
] ,i 15 sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating 

I I a computer readable medium having recorded thereon a nucleotide sequence of the 
present invention. The choice of the data storage structure will generally be based on 

' the means chosen to access the stored information. In addition, a variety of data 
J:;; 20 processor programs and formats can be used to store the nucleotide sequence 

information of the present invention on computer readable medium. The sequence 
information can be represented in a word processing text file, formatted in 
commercially-available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, 
25 Sybase, Oracle, or the like. A skilled artisan can readily adapt any number of data 
processor structuring formats (e.g. text file or database) in order to obtain computer 
readable medium having recorded thereon the nucleotide sequence information of the 
present invention. 

By providing any of the nucleotide sequences SEQ ID NOs: 1 - 419 or a 
30 representative fragment thereof; or a nucleotide sequence at least 95% identical to any 
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of the nucleotide sequences of the SEQ ID NOs: 1 - 419 in computer readable form, a 
skilled artisan can routinely access the sequence information for a variety of purposes. 
Computer software is publicly available which allows a skilled artisan to access 
sequence information provided in a computer readable medium. The examples which 
5 follow demonstrate how software which implements the BLAST (Altschul et ak, J. 

Mok Biok 215:403-410 (1990)) and BLAZE (Brutlag et ak, Comp. Chem. 17:203-207 
(1993)) search algorithms on a Sybase system is used to identify open reading frames 
(ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding fragments 
and may be useful in producing commercially important proteins such as enzymes used 

10 in fermentation reactions and in the production of commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, 
software means, and data storage means used to analyze the nucleotide sequence 
information of the present invention. The minimum hardware means of the 
computer-based systems of the present invention comprises a central processing unit 

15 (CPU), input means, output means, and data storage means. A skilled artisan can 
readily appreciate that any one of the currently available computer-based systems are 
suitable for use in the present invention. As stated above, the computer-based systems 
of the present invention comprise a data storage means having stored therein a 
nucleotide sequence of the present invention and the necessary hardware means and 

20 software means for supporting and implementing a search means. As used herein, 
"data storage means" refers to memory which can store nucleotide sequence 
information of the present invention, or a memory access means which can access 
manufactures having recorded thereon the nucleotide sequence information of the 
present invention. 

25 As used herein, "search means" refers to one or more programs which are 

implemented on the computer-based system to compare a target sequence or target 
structural motif with the sequence information stored within the data storage means. 
Search means are used to identify fragments or regions of a known sequence which 
match a particular target sequence or target motif. A variety of known algorithms are 

30 disclosed publicly and a variety of commercially available software for conducting 
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search means are and can be used in the computer-based systems of the present 
invention. Examples of such software includes, but is not limited to, Smith- Waterman, 
MacPattern (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA) . A skilled artisan 
can readily recognize that any one of the available algorithms or implementing software 
5 packages for conducting homology searches can be adapted for use in the present 

computer-based systems. As used herein, a "target sequence" can be any nucleic acid 
or amino acid sequence of six or more nucleotides or two or more amino acids. A 
skilled artisan can readily recognize that the longer a target sequence is, the less likely a 
target sequence will be present as a random occurrence in the database. The most 
10 preferred sequence length of a target sequence is from about 10 to 300 amino acids, 
more preferably from about 30 to 100 nucleotide residues. However, it is well 
recognized that searches for commercially important fragments, such as sequence 
fragments involved in gene expression and protein processing, may be of shorter 
length. 

15 As used herein, "a target structural motif," or "target motif," refers to any 

rationally selected sequence or combination of sequences in which the sequence(s) are 
chosen based on a three-dimensional configuration which is formed upon the folding of 
the target motif. There are a variety of target motifs known in the art. Protein target 
motifs include, but are not limited to, enzyme active sites and signal sequences. 

20 Nucleic acid target motifs include, but are not limited to, promoter sequences, hairpin 
structures and inducible expression elements (protein binding sequences). 

4.12 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be 
25 used to control gene expression through triple helix formation or antisense DNA or 

RNA, both of which methods are based on the binding of a polynucleotide sequence to 
DNA or RNA. Polynucleotides suitable for use in these methods are preferably 20 to 
40 bases in length and are designed to be complementary to a region of the gene 
involved in transcription (triple helix - see Lee et al., Nucl. Acids Res. 6:3073 (1979); 
30 Cooney et al., Science 15241:456 (1988); and Dervan et al., Science 251:1360 (1991)) 
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or to the mRNA itself (antisense - Olmno, J. Neurochem. 56:560 (1991); 
Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA 
transcription from DNA, while antisense RNA hybridization blocks translation of an 
5 mRNA molecule into polypeptide. Both techniques have been demonstrated to be 
effective in model systems. Information contained in the sequences of the present 
invention is necessary for the design of an antisense or triple helix oligonucleotide. 



4.13 DIAGNOSTIC ASSAYS AND KITS 

10 The present invention further provides methods to identify the presence or 

expression of one of the ORFs of the present invention, or homolog thereof, in a test 
sample, using a nucleic acid probe or antibodies of the present invention, optionally 
conjugated or otherwise associated with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
15 contacting a sample with a compound that binds to and forms a complex with the 

polynucleotide for a period sufficient to form the complex, and detecting the complex, 
so that if a complex is detected, a polynucleotide of the invention is detected in the 
sample. Such methods can also comprise contacting a sample under stringent 
hybridization conditions with nucleic acid primers that anneal to a polynucleotide of the 
20 invention under such conditions, and amplifying annealed polynucleotides, so that if a 
polynucleotide is amplified, a polynucleotide of the invention is detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the 
polypeptide for a period sufficient to form the complex, and detecting the complex, so 
25 that if a complex is detected, a polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of 
the antibodies or one or more of the nucleic acid probes of the present invention and 
assaying for binding of the nucleic acid probes or antibodies to components within the 
test sample. 
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Conditions for incubating a nucleic acid probe or antibody with a test sample 
vary. Incubation conditions depend on the format employed in the assay, the detection 
methods employed, and the type and nature of the nucleic acid probe or antibody used 
in the assay. One skilled in the art will recognize that any one of the commonly 
5 available hybridization, amplification or immunological assay formats can readily be 
adapted to employ the nucleic acid probes or antibodies of the present invention. 
Examples of such assays can be found in Chard, T., An Introduction to 
Radioimmunoassay and Related Techniques, Elsevier Science Publishers, Amsterdam, 
The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, 

10 Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., 
Practice and Theory of immunoassays: Laboratory Techniques in Biochemistry and 
Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1985). 
The test samples of the present invention include cells, protein or membrane extracts of 
cells, or biological fluids such as sputum, blood, serum, plasma, or urine. The test 

15 sample used in the above-described method will vary based on the assay format, nature 
of the detection method and the tissues, cells or extracts used as the sample to be 
assayed. Methods for preparing protein extracts or membrane extracts of cells are well 
known in the art and can be readily be adapted in order to obtain a sample which is 
compatible with the system utilized. 

20 In another embodiment of the present invention, kits are provided which contain 

the necessary reagents to carry out the assays of the present invention. Specifically, 
the invention provides a compartment kit to receive, in close confinement, one or more 
containers which comprises: (a) a first container comprising one of the probes or 
antibodies of the present invention; and (b) one or more other containers comprising 

25 one or more of the following: wash reagents, reagents capable of detecting presence of 
a bound probe or antibody. 

In detail, a compartment kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers 
or strips of plastic or paper. Such containers allows one to efficiently transfer reagents 

30 from one compartment to another compartment such that the samples and reagents are 
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not cross-contaminated, and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the antibodies 
used in the assay, containers which contain wash reagents (such as phosphate buffered 
5 saline, Tris-buffers, etc.), and containers which contain the reagents used to detect the 
bound antibody or probe. Types of detection reagents include labeled nucleic acid 
probes, labeled secondary antibodies, or in the alternative, if the primary antibody is 
labeled, the enzymatic, or antibody binding reagents which are capable of reacting with 
the labeled antibody. One skilled in the art will readily recognize that the disclosed 
10 probes and antibodies of the present invention can be readily incorporated into one of 
the established kit formats which are well known in the art. 

4.14 MEDICAL IMAGING 

The novel polypeptides and binding partners of the invention are useful in 
15 medical imaging of sites expressing the molecules of the invention (e.g. , where the 
polypeptide of the invention is involved in the immune response, for imaging sites of 
inflammation or infection). See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778. Such 
methods involve chemical attachment of a labeling or imaging agent, administration of 
the labeled polypeptide to a subject in a pharmaceutical^ acceptable carrier, and 
20 imaging the labeled polypeptide in vivo at the target site. 

4.15 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present 
invention further provides methods of obtaining and identifying agents which bind to a 
25 polypeptide encoded by an ORF corresponding to any of the nucleotide sequences set 
forth in the SEQ ID NOs: 1 - 419, or bind to a specific domain of the polypeptide 
encoded by the nucleic acid. In detail, said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the 
present invention, or nucleic acid of the invention; and 
30 (b) determining whether the agent binds to said protein or said nucleic acid. 
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In general, therefore, such methods for identifying compounds that bind to a 
polynucleotide of the invention can comprise contacting a compound with a 
polynucleotide of the invention for a time sufficient to form a polynucleotide/compound 
complex, and detecting the complex, so that if a polynucleotide/compound complex is 
5 detected, a compound that binds to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that 
bind to a polypeptide of the invention can comprise contacting a compound with a 
polypeptide of the invention for a time sufficient to form a polypeptide/compound 
complex, and detecting the complex, so that if a polypeptide/compound complex is 
10 detected, a compound that binds to a polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention 
can also comprise contacting a compound with a polypeptide of the invention in a cell 
for a time sufficient to form a polypeptide/compound complex, wherein the complex 
drives expression of a receptor gene sequence in the cell, and detecting the complex by 
15 detecting reporter gene sequence expression, so that if a polypeptide/compound 

complex is detected, a compound that binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate 
the activity of a polypeptide of the invention (that is, increase or decrease its activity, 
relative to activity observed in the absence of the compound). Alternatively, 
20 compounds identified via such methods can include compounds which modulate the 

expression of a polynucleotide of the invention (that is, increase or decrease expression 
relative to expression levels observed in the absence of the compound). Compounds, 
such as compounds identified via the methods of the invention, can be tested using 
standard assays well known to those of skill in the art for their ability to modulate 
25 activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be 
selected and screened at random or rationally selected or designed using protein 
modeling techniques. 
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For random screening, agents such as peptides, carbohydrates, pharmaceutical 
agents and the like are selected at random and are assayed for their ability to bind to the 
protein encoded by the ORF of the present invention. Alternatively, agents may be 
rationally selected or designed. As used herein, an agent is said to be "rationally 
5 selected or designed" when the agent is chosen based on the configuration of the 
particular protein. For example, one skilled in the art can readily adapt currently 
available procedures to generate peptides, pharmaceutical agents and the like, capable 
of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: 

10 Antisense Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY 
(1992), pp. 289-307, and Kaspczak et al., Biochemistry 28:9230-8 (1989), or 
pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as 
broadly described, can be used to control gene expression through binding to one of the 

15 ORFs or EMFs of the present invention. As described above, such agents can be 

randomly screened or rationally designed/selected. Targeting the ORF or EMF allows 
a skilled artisan to design sequence specific or element specific agents, modulating the 
expression of either a single ORF or multiple ORFs which rely on the same EMF for 
expression control. One class of DNA binding agents are agents which contain base 

20 residues which hybridize or form a triple helix formation by binding to DNA or RNA. 
Such agents can be based on the classic phosphodiester, ribonucleic acid backbone, or 
can be a variety of sulfhydryl or polymeric derivatives which have base attachment 
capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and 
25 are designed to be complementary to a region of the gene involved in transcription 
(triple helix - see Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 
241:456 (1988); and Dervan et aL, Science 251:1360 (1991)) or to the mRNA itself 
(antisense - Okano, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense 
Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple 
30 helix-formation optimally results in a shut-off of RNA transcription from DNA, while 
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antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. 
Both techniques have been demonstrated to be effective in model systems. Information 
contained in the sequences of the present invention is necessary for the design of an 
antisense or triple helix oligonucleotide and other DNA binding agents. 
5 Agents which bind to a protein encoded by one of the ORFs of the present 

invention can be used as a diagnostic agent. Agents which bind to a protein encoded by 
one of the ORFs of the present invention can be formulated using known techniques to 
generate a pharmaceutical composition. 

10 4.16 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific 
nucleic acid hybridization probes capable of hybridizing with naturally occurring 
nucleotide sequences. The hybridization probes of the subject invention may be derived 
from any of the nucleotide sequences SEQ ID NOs: 1 - 419. Because the 

15 corresponding gene is only expressed in a limited number of tissues, a hybridization 
probe derived from of any of the nucleotide sequences SEQ ID NOs: 1 - 419 can be 
used as an indicator of the presence of RNA of cell type of such a tissue in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in 
situ hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 

20 provides additional uses for oligonucleotides based upon the nucleotide sequences. 
Such probes used in PCR may be of recombinant origin, may be chemically 
synthesized, or a mixture of both. The probe will comprise a discrete nucleotide 
sequence for the detection of identical sequences or a degenerate pool of possible 
sequences for identification of closely related genomic sequences. 

25 Other means for producing specific hybridization probes for nucleic acids 

include the cloning of nucleic acid sequences into vectors for the production of mRNA 
probes. Such vectors are known in the art and are commercially available and may be 
used to synthesize RNA probes in vitro by means of the addition of the appropriate 
RNA polymerase as T7 or SP6 RNA polymerase and the appropriate radioactively 

30 labeled nucleotides. The nucleotide sequences may be used to construct hybridization 
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probes for mapping their respective genomic sequences. The nucleotide sequence 
provided herein may be mapped to a chromosome or specific regions of a chromosome 
using well known genetic and/or chromosomal mapping techniques. These techniques 
include in situ hybridization, linkage analysis against known chromosomal markers, 
5 hybridization screening with libraries or flow-sorted chromosomal preparations specific 
to known chromosomes, and the like. The technique of fluorescent in situ hybridization 
of chromosome spreads has been described, among other places, in Verma et al (1988) 
Human Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York 
NY. 

10 Fluorescent in situ hybridization of chromosomal preparations and other 

physical chromosome mapping techniques may be correlated with additional genetic 
map data. Examples of genetic map data can be found in the 1994 Genome Issue of 
Science (265: 1981f). Correlation between the location of a nucleic acid on a physical 
chromosomal map and a specific disease (or predisposition to a specific disease) may 

15 help delimit the region of DNA associated with that genetic disease. The nucleotide 
sequences of the subject invention may be used to detect differences in gene sequences 
between normal, carrier or affected individuals. 

4.17 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, 
20 for example, directly synthesizing the oligonucleotide by chemical means, as is commonly 

practiced using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to 

those of skill in the art using any suitable support such as glass, polystyrene or Teflon. 

One strategy is to precisely spot oligonucleotides synthesized by standard synthesizers. 
25 Immobilization can be achieved using passive adsorption (Inouye & Hondo, (1990) J . 

Clin. Microbiol. 28(6) 1469-72); using UV light (Nagata et al, 1985; Dahlen et al, 

1987; Morrissey & Collins, (1989) Mol. Cell Probes 3(2) 189-207) or by covalent binding 

of base modified DNA (Keller et al, 1988; 1989); all references being specifically 

incorporated herein. 
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Another strategy that may be employed is the use of the strong biotin-streptavidin 
interaction as a linker. For example, Broude et al (1994) Proc. Natl. Acad. ScL USA 
91(8) 3072-6, describe the use of biotinylated probes, although these are duplex probes, 
that are immobilized on streptavidin-coated magnetic beads. Streptavidin-coated beads 

5 may be purchased from Dynal, Oslo. Of course, this same linking chemistry is applicable 
to coating any surface with streptavidin. Biotinylated probes may be purchased from 
various sources, such as, e.g., Operon Technologies (Alameda, CA). 

Nunc Laboratories (Naperville, IL) is also selling suitable material that could be 
used. Nunc Laboratories have developed a method by which DNA can be covalently 

10 bound to the microwell surface termed Covalink NH. CovaLink NH is a polystyrene 
surface grafted with secondary amino groups ( > NH) that serve as bridge-heads for 
further covalent coupling. CovaLink Modules may be purchased from Nunc 
Laboratories. DNA molecules may be bound to CovaLink exclusively at the 5 '-end by a 
phosphoramidate bond, allowing immobilization of more than 1 pmol of DNA 

15 (Rasmussen et a/., (1991) Anal. Biochem. 198(1) 138-42). 

The use of CovaLink NH strips for covalent binding of DNA molecules at the 
5*-end has been described (Rasmussen et al. , (1991). In this technology, a 
phosphoramidate bond is employed (Chu et al., (1983) Nucleic Acids Res. 11(8) 6513- 
29). This is beneficial as immobilization using only a single covalent bond is preferred. 

20 The phosphoramidate bond joins the DNA to the CovaLink NH secondary amino groups 
that are positioned at the end of spacer arms covalently grafted onto the polystyrene 
surface through a 2 nm long spacer arm. To link an oligonucleotide to CovaLink NH via 
an phosphoramidate bond, the oligonucleotide terminus must have a 5'-end phosphate 
group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and 

25 then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 
ng/ul) and denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0. 1 M 
1-methylimidazole, pH 7.0 (l-Melm?), is then added to a final concentration of 10 mM 
l-Melm?. A ss DNA solution is then dispensed into CovaLink NH strips (75 ul/well) 

30 standing on ice. 

787CIP2B 102 



Carbodiimide 0.2 M l-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), 
dissolved in 10 mM l-Melm?, is made fresh and 25 ul added per well. The strips are 
incubated for 5 hours at 50°C After incubation the strips are washed using, e.g. , 
Nunc-Immuno Wash; first the wells are washed 3 times, then they are soaked with 
5 washing solution for 5 min., and finally they are washed 3 times (where in the washing 
solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention 
is that described in PCT Patent Application WO 90/03382 (Southern & Maskos), 
incorporated herein by reference. This method of preparing an oligonucleotide bound to a 

10 support involves attaching a nucleoside 3 '-reagent through the phosphate group by a 
covalent phosphodiester link to aliphatic hydroxyl groups carried by the support. The 
oligonucleotide is then synthesized on the supported nucleoside and protecting groups 
removed from the synthetic oligonucleotide chain under standard conditions that do not 
cleave the oligonucleotide from the support. Suitable reagents include nucleoside 

15 phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA 
probe arrays may be employed. For example, addressable laser-activated 
photodeprotection may be employed in the chemical synthesis of oligonucleotides directly 
on a glass surface, as described by Fodor et al (1991) Science 251(4995) 767-73, 

20 incorporated herein by reference. Probes may also be immobilized on nylon supports as 
described by Van Ness et al (1991) Nucleic Acids Res. 19(12) 3345-50; or linked to 
Teflon using the method of Duncan & Cavalier (1988) Anal. Biochem. 169(1) 104-8; all 
references being specifically incorporated herein. 

To link an oligonucleotide to a nylon support, as described by Van Ness et al 

25 (1991), requires activation of the nylon surface via alkylation and selective activation of 
the 5' -amine of oligonucleotides with cyanuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the 
light-generated synthesis described by Pease et al, (1994) PNAS USA 91(11) 5022-6, 
incorporated herein by reference). These authors used current photolithographic 

30 techniques to generate arrays of immobilized oligonucleotide probes (DNA chips). These 
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methods, in which light is used to direct the synthesis of oligonucleotide probes in 
high-density, miniaturized arrays, utilize photolabile 5'-protected iV-acyl-deoxynucleoside 
phosphoramidites, surface linker chemistry and versatile combinatorial synthesis 
strategies. A matrix of 256 spatially defined oligonucleotide probes may be generated in 
5 this manner. 

4.18 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, 
genomic DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC 
inserts, and RNA, including mRNA without any amplification steps. For example, 

10 Sambrook et al (1989) describes three protocols for the isolation of high molecular weight 
DNA from mammalian cells (p. 9.14-9.23). 

DNA fragments may be prepared as clones in M13, plasmid or lambda vectors 
and/or prepared directly from genomic DNA or cDNA by PCR or other amplification 
methods. Samples may be prepared or dispensed in multiwell plates. About 100-1000 ng 

15 of DNA samples may be prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those 
of skill in the art including, for example, using restriction enzymes as described at 
9.24-9.28 of Sambrook et al (1989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al (1990) 

20 Nucleic Acids Res. 18(24) 7455-6, incorporated herein by reference). In this method, 
DNA samples are passed through a small French pressure cell at a variety of low to 
intermediate pressures. A lever device allows controlled application of low to 
intermediate pressures to the cell. The results of these studies indicate that low-pressure 
shearing is a useful alternative to sonic and enzymatic DNA fragmentation methods. 

25 One particularly suitable way for fragmenting DNA is contemplated to be that 

using the two base recognition endonuclease, Cv/JI, described by Fitzgerald et al. (1992) 
Nucleic Acids Res. 20(14) 3753-62. These authors described an approach for the rapid 
fragmentation and fractionation of DNA into particular sizes that they contemplated to be 
suitable for shotgun cloning and sequencing. 
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The restriction endonuclease CviJl normally cleaves the recognition sequence 
PuGCPy between the G and C to leave blunt ends. Atypical reaction conditions, which 
alter the specificity of this enzyme (Cv/JI**), yield a quasi-random distribution of DNA 
fragments form the small molecule pUC19 (2688 base pairs), Fitzgerald et al (1992) 
5 quantitatively evaluated the randomness of this fragmentation strategy, using a CV/JI** 
digest of pUC19 that was size fractionated by a rapid gel filtration method and directly 
ligated, without end repair, to a lac Z minus M13 cloning vector. Sequence analysis of 76 
clones showed that CwJI** restricts pyGCPy and PuGCPu, in addition to PuGCPy sites, 
and that new sequence data is accumulated at a rate consistent with random fragmentation. 
10 As reported in the literature, advantages of this approach compared to sonication 

and agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug 
D instead of 2-5 ug); and fewer steps are involved (no preligation, end repair, chemical 
- ;j extraction, or agarose gel electrophoresis and elution are needed 
■ ^ Irrespective of the manner in which the nucleic acid fragments are obtained or 

; j 15 prepared, it is important to denature the DNA to give single stranded pieces available for 
I rt hybridization. This is achieved by incubating the DNA solution for 2-5 minutes at 

80-90°C. The solution is then cooled quickly to 2°C to prevent renaturation of the DNA 
I fragments before they are contacted with the chip. Phosphate groups must also be 
removed from genomic DNA by methods known in the art. 

^ 20 4.19 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon 
membrane. Spotting may be performed by using arrays of metal pins (the positions of 
which correspond to an array of wells in a microtiter plate) to repeated by transfer of 
about 20 nl of a DNA solution to a nylon membrane. By offset printing, a density of dots 
25 higher than the density of the wells is achieved. One to 25 dots may be accommodated in 
1 mm 2 , depending on the type of label used. By avoiding spotting in some preselected 
number of rows and columns, separate subsets (subarrays) may be formed. Samples in one 
subarray may be the same genomic segment of DNA (or the same gene) from different 
individuals, or may be different, overlapped genomic clones. Each of the subarrays may 
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represent replica spotting of the same samples. In one example, a selected gene segment 
may be amplified from 64 patients. For each patient, the amplified gene segment may be 
in one 96-well plate (all 96 wells containing the same sample). A plate for each of the 64 
patients is prepared. By using a 96-pin device, all samples may be spotted on one 8 x 12 
5 cm membrane. Subarrays may contain 64 samples, one from each patient. Where the 96 
subarrays are identical, the dot span may be 1 mm 2 and there may be a 1 mm space 
between subarrays. 

Another approach is to use membranes or plates (available from NUNC, 
Naperville, Illinois) which may be partitioned by physical spacers e.g. a plastic grid 

10 molded over the membrane, the grid being similar to the sort of membrane applied to the 
bottom of multiwell plates, or hydrophobic strips. A fixed physical spacer is not preferred 
for imaging by exposure to flat phosphor-storage screens or x-ray films. 

The present invention is illustrated in the following examples. Upon consideration 
of the present disclosure, one of skill in the art will appreciate that many other 

15 embodiments and variations may be made in the scope of the present invention. 

Accordingly, it is intended that the broader aspects of the present invention not be limited 
to the disclosure of the following examples. The present invention is not to be limited in 
scope by the exemplified embodiments which are intended as illustrations of single aspects 
of the invention, and compositions and methods which are functionally equivalent are 

20 within the scope of the invention. Indeed, numerous modifications and variations in the 
practice of the invention are expected to occur to those skilled in the art upon 
consideration of the present preferred embodiments. Consequently, the only limitations 
which should be placed upon the scope of the invention are those which appear in the 
appended claims. 

25 All references cited within the body of the instant specification are hereby 

incorporated by reference in their entirety. 

5.0 EXAMPLES 

5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 
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A plurality of novel nucleic acids were obtained from cDNA libraries prepared 
from various human tissues and in some cases isolated from a genomic library derived 
from human chromosome using standard PCR, SBH sequence signature analysis and 
Sanger sequencing techniques. The inserts of the library were amplified with PCR using 
5 primers specific for the vector sequences which flank the inserts. Clones from cDNA 
libraries were spotted on nylon membrane filters and screened with oligonucleotide probes 
(e.g., 7-mers) to obtain signature sequences. The clones were clustered into groups of 
similar or identical sequences. Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a 
10 typical Sanger sequencing protocol. PCR products were purified and subjected to 

fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using a 
377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In 
some cases RACE (Random Amplification of cDNA Ends) was performed to further 
extend the sequence in the 5' direction. 

15 5.2 EXAMPLE 2 

Novel Nucleic Acids 

The novel nucleic acids of the present invention of the invention were assembled 
from sequences that were obtained from a cDNA library by methods described in 
Example 1 above, and in some cases sequences obtained from one or more public 

20 databases. The nucleic acids were assembled using an EST sequence as a seed. Then a 
recursive algorithm was used to extend the seed EST into an extended assemblage, by 
pulling additional sequences from different databases (i.e., Hyseq's database containing 
EST sequences, dbEST version 114, gb pri 114, and UniGene version 101) that belong to 
this assemblage. The algorithm terminated when there was no additional sequences from 

25 the above databases that would extend the assemblage. Inclusion of component sequences 
into the assemblage was based on a BLASTN hit to the extending assemblage with 
BLAST score greater than 300 and percent identity greater than 95 % . 

Using PHRAP (Univ. of Washington) or CAP4 (Paracel), a full length gene 
cDNA sequence and its corresponding protein sequence were generated from the 
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assemblage. Any frame shifts and incorrect stop codons were corrected by hand editing. 
During editing, the sequence was checked using FASTY and/or BLAST against Genbank 
(i.e., dbEST version 117, gb pri 117, UniGene version 117, Genepet release 117). Other 
computer programs which may have been used in the editing process were phredPhrap 
5 and Consed (University of Washington) and ed-ready, ed-ext and gc-zip-2 (Hyseq, Inc.). 
The full-length nucleotide and amino acid sequences, including splice variants resulting 
from these procedures are shown in the Sequence Listing as SEQ ID NOS: 1- 419. 

Table 1 shows the various tissue sources of SEQ ID NO: 1-419. 

The nearest neighbor results for SEQ ID NO: 1-419 were obtained by a 
10 BLASTP version 2.0al 19MP-WashU search against Genpept release 118, using 

BLAST algorithm. The nearest neighbor result showed the closest homologue for SEQ 
ID NO: 1-419 from Genpept (and contains the translated amino acid sequences for 
which the nucleic acid sequence encodes). The nearest neighbor results for SEQ ID 
NO: 1-419 are shown in Table 2 below. 

15 
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TABLE 1 



TISSUE ORIGIN 


RNA SOURCE 


HYSEQ LIBRARY 


SEQ ID NOS: 








NAME 




O.U.UX U 


.ora in 


GIBCO 


AB3001 


23 45 52 55 59 61 65-66 69 72 










75-76 80 105 123 130 133-134 










137 147 149 157-158 179 198 202 










207 212-213 233 246 253-254 259 










263 272-273 275 295 303 327 352 










364 389 391-392 406 


aHn 1 t" 


Kv>q t n 
Jw* i ct JLii. 


GIBCO 


ABDO03 


3 7 14 19 22-23 26 29 31-32 37 










43-45 48 50-51 55 58-59 62 65 










69-70 74 77 79-80 85-86 91 105 










111 113 115-116 123 133 135 










144-145 149-150 155 157-158 168 










179 186 191 198 210-211 213 221 










224 227-228 230-233 236-237 239 










242-243 246 248 252 259 261 










263-265 268-275 281-283 286 288 










303 311 316-317 321 329 332 334 










342-343 346 352 377 379 385 389 










395-396 400 


aclu.lt 


bi"3.in 


Clontech 


ABR0 01 


7 29 32 44-45 52 69 77-78 80 










110 191 222 233 236 257-258 262 










271 297-298 341 343 368 385 389 










393 401 407 


aClUX C 


JJ X ct _LX1 


Clontech 


ABR006 


3 10-12 23 33 39 43 45 80-81 










83-84 94 117 198 272 300 385 










387-389 398 413-414 417 


acLum 


, ' 

brain 


Clontech 


ABR0 0 8 


1 3 5-7 12 15 18-23 25 28-30 










32-35 37-43 47-48 51-52 54 58- 










61 63 67-70 72-73 75-76 79 82- 










86 92 94-99 101 105-106 109 111 










113 120 128 131-132 134 137 










139-147 154 156 159 165 168-171 










173 176-181 184 187-188 191-194 










197 200 202 204 210-211 215 218 










220 224 230-233 238-239 243 










247-248 252-259 262-263 265-266 










268-269 272-273 275 281-286 294 










296 298-301 303-305 308 310 










312-314 317-318 321 325 327 342 










346-352 354-355 361 364 368 370 










377 381-383 385 387-388 391-393 










398 400-403 406-408 410 414 










416-417 419 


adult 


brain 


Clontech 


ABR011 


85 


adult 


brain 


BioChain 


ABR012 


344 


adult 


brain 


Invitrogen 


ABR013 


74 141-143 385 


adult 


brain 


Invitrogen 


ABR014 


43 48 417 


adult 


brain 


Invitrogen 


ABR015 


391-392 417 


adult 


brain 


Invitrogen 


ABR016 


23 45 48 54 337 395-396 


adult 


brain 


Invitrogen 


ABT004 


20 23 37 40 43 48 50 58 60 78 










85-86 105 111 126 137 145 147 










159 161 164 188 191 194 198 208 










212 223 229 237 239 252-256 263 










271-272 275 291 295 299 334 368 
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adipocytes 



adrenal gland 



Strategene 



Clontech 



ADP001 



ADR002 



376 381 385 388-389 393 398 401 
406 419 



10-11 20 23 37 43 50 52 54 60 
69 86 102 115-116 119 123 127 
145 156-158 166 179 181-182 234 
238 243 253-254 259 261 268-272 
281-282 284 290 294-295 314 346 
382 399 414 418 



10-11 13 21 25-26 31 33 50-51 

54-55 65 69 80 86 93 95 97 106 

111 133 149 156 166 173 181-182 

188 194 203 210-211 213 238 248 
253-254 257-258 286 294 298 302 

311 317 327 346 356 383 393 398 
405 412 418 



adult heart 



GIBCO 



AHR001 



adult kidney 



GIBCO 



AKD001 



adult kidney 



Invitrogen 



AKT002 



adult lung 



GIBCO 



ALG0 01 



1 7 10-12 19 
45 47 50 52 
76 79-80 85 
119-123 130 
141-143 145 
157-158 163 
196-197 202 
227-228 236 
252-254 257 
272-273 277 
300 305 311 
344 346 350 
385 390 398 
414 418 



20 29 31-32 
54-55 59-65 
101-102 113- 
133 136-137 
148-149 154- 
172 178-179 
207 212-214 
238 240 243 
259 261-264 
279 288-289 
314 325 335- 
352 361 364 
400-401 403- 



37 43- 
72 74- 
114 
139 
155 
182 
222 
248 

268-270 
294-296 
336 341 
366 368 
406 412 



1 3-4 7 10 
29 31-32 37 
58-62 65 67 
91 101-103 
123 126 128 
141-144 147 
171 174 178 
191-192 196 
218-219 223 
235-240 244 
268-272 275 
285-287 290 
311 317-318 
341-344 346 
370 373-375 
400-401 404 



12 14 19-20 

44-48 50-52 
-69 74-77 79 
111 113-114 

130 132-134 
-149 153 155 
-179 181-182 
200-201 207 
-224 227-228 
-246 252-259 
277-279 281 
294-295 301 
329-330 332 
350 352 361 
377 382-385 
-406 414 418 



22 25-27 

54-55 
-80 89 
119 121- 

136-138 

159 166 

188 

209-213 
230-233 
261-264 
-282 
305 309 
334 

364 368 
391-394 



4 12 21 
75-76 80 
137-138 
169-170 
212 232 
274-275 
357 366 



29 32 45 48 

97 103 110 

144 147 153 

179 186 188- 

233 242 264 

284 308 341 

385 398 405- 



51 67-68 
120-123 
155 157-158 
190 194 196 
268-269 272 
343 346 352 
406 414 419 



2 10-11 
114 118 
157-158 
264 270 



37 50 59 69 
123 133 147 
166 179 181 
-271 274 277 



-70 75-76 80 

149 155 

223 242 246 
-279 289 311 
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lymph node 



young liver 



Clontech 



GIBCO 



ALN0 01 



ALV0 01 



332 334 337 342 370 384 3 94 



43 54 60 69 71 108 113 123 134 
152 155-156 181 212 222 232 270 
274 281-282 294 322 337 350 366 
368 380 400 



7 14 17 25 49 59-61 80 123 130- 

131 134 145 149 153-154 166 

169-171 173 179 181-182 196 200 

212 231 233 261-262 272 275 286 

295 344 361 364 368 373 405 408 
414 



adult liver 



Invitrogen 



ALVO02 



adult ovary 



Invitrogen 



AOV0 01 



adult placenta 



Clontech 



APL001 



3 13 17 21 25 47-48 51 75-76 88 
91 100 107 114 123 131 134 139 
155 164 174 176 194 196 201 217 
220 223-225 232 237 244-245 
255-256 259 261 272 281-282 
285-286 288 310 317 343 346 352 
376 379 398 404 416 



3-7 10-12 14 17 19-21 23 25 28- 
29 31-32 37 43-45 47-48 50-51 
54-55 58-61 65 67-70 72 74-82 
87 91-92 98-101 103 111 113 
115-116 118-120 123 128 131-133 
137 139 141-145 147 149-153 
155-158 160 164-167 169-173 
178-179 181-182 186 188-191 194 
200 204 207 209-214 218 220 
222-223 227-228 231-233 235-240 
242-243 246 248 251-256 259 
261-262 264-265 268-279 284-288 
290 294-295 298-301 303 305 307 
309 311 317 321-323 325 327 332 
334 337 341 344 346 350-352 
354-355 366 370 374-375 377 
382-385 391-397 400-403 405-406 
408 412 414 418 



29 32 100 123 128 149 227-228 
247 295 301 382 



placenta 



Invitrogen 



APL0 02 



adult spleen 



GIBCO 



ASP001 



adult testis 



GIBCO 



ATS001 



20 23 43 
253-258 



48 110 139 
261 271 352 



232 238 
398 



12 43 52 
113 121- 
163 166 
243 253 
294-295 
334-335 
368 373 



54-55 59-61 

123 131 137 

181 186 191 

254 261 264 

308 311 315 

337 342 344 

376 382 400 



64 80 108 

149 155 159 

210-212 239 

270 272 287 

327 329 332 

345 350 352 
401 



1 10-11 
61 79-80 
144-145 
194 197 
243 259 
294-295 
400-401 



14 29 32 37 
90 118-119 
149 155 169 
209 212 224 
267-269 271 
350 352 361 
418 



44 50 54-55 
123 128 
-170 182 192 
232 239-240 
274-275 289 
364 398 



Genomic DNA 
from BAC 63118 



Research 
Genetics (CITB 



BAC001 



164 
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BAC Library) 






Genomic DNA 


Research 


BAC0 02 


291 


from BAC 3 9316 


Genetics (CITB 








BAC Library) 






Genomic DNA 


Research 


BAC0 03 


291 


from BAC 3 9316 


Genetics (CITB 








BAC Library) 






adult bladder 


Invitrogen 


BLD001 


43 80 91 137 146 154 167 201 








238-239 272 287 317 329 368 








391-392 


bone marrow 


Clontech 


BMDOO 1 


1-3 7 12 14 16 26 31 37 43-46 








49 54 57 59-61 67-70 74-77 80 








82 84 91 98-99 102 104 108 113 








117-119 123 127-128 130 133 139 








145 153 155 157-158 160 168-170 








179 181 188 202 207-208 210-212 








222 230 232 236 242 247 250 259 








261 264-265 268-270 274 276-279 








286 289 293-295 300 311 317 325 








335 352 364 366 370 384 387 393 








395-396 398 414 


bone marrow 


Clontech 


BMD002 


1-4 7 10-11 13 16 19 21 36 43- 








44 48 52 54 58 60 98-99 108 110 








117 123 137-138 173 179 181 








230-232 242 253-254 262 268-269 








272-273 281-283 292 294 314 316 








347 370 383 387 393 398 405 408 








414 


bone marrow 


Clontech 


BMDOO 7 


419 


adult colon 


Invitrogen 


CLM0 01 


12 26 33 101 104 108 119 179 








232 253-254 270 300 340 376 381 








404 414 


Mixture of 16 


Various 


CTL016 


7 393 413 


tissues- 


Vendors* 






mRNAs * 








Mixture of 16 


Various 


CTL021 


117 176 


tissues- 


Vendors* 






mRNAs* 








adult cervix 


BioChain 


CVX0 01 


3 7 12 17 29 31-33 38 43 45 48 








50 54-56 59 65 67-70 77 79-80 








86 91 102-103 108 113 118 120- 








122 125 129 133 141-144 149 153 








155-158 165-166 175 179 181 194 



* The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA 
(Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal fetal liver mRNA (Invitrogen), 7) normal fetal 
skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 10) 
human leukemia lymphablastic mRNA (Clontech), 11) human thymus mRNA (Clontech), 12) human lymph node 
mRNA (Clontech), 13) human spinal cord mRNA (Clontech), 14) human thyroid mRNA (Clontech), 15) human 
esophagus mRNA (BioChain), 16) human conceptional umbilical cord mRNA (BioChain). 
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199-200 212-214 218 227-228 








235-236 240 259 262 264 270 272 








274 279 281-282 285-288 293 295 








301 309-311 317-318 327 332 335 








342-343 346 353 356 361 366 








374-375 377 381 385 395-396 400 








403 405 408 410 414 416 


diaphragm 


BioChain 


DIA002 


80 212 


endothelial 


Strategene 


EDT0 01 


1 3-4 7 10-12 14 16 20-21 24 


cells 






28-29 32 38 43-44 47-52 54-55 








58-61 74-77 86 91-92 97 103 113 








115-116 123 128 130 139 141-147 








149 152 155-158 160 166 169-170 








172-173 179 181 186 189-191 207 








210-212 214 218-219 223 231-233 








236 238-239 246 253-259 261-262 








264 268-273 276 279 281-282 








284-288 294-295 298 301 303 305 








312 314-315 335 341 343-346 349 








351-352 361 365 368 370 373-375 








377 380 382 393 398 401-403 405 








414 418 


Genomic clones 


DNA from 


EPM0O1 


164 291 


from the short 


Genetic 






arm of 


Research 






chromosome 8 








esophagus 


BioChain 


ESO002 


20 123 


fetal brain 


Clontech 


FBR0O1 


82 95 152 


fetal brain 


Clontech 


FBR0O4 


33 81 134 150 198 304 400 407 








414 


fetal brain 


Clontech 


FBR00 6 


3-6 12 16-20 23 37 40 43-44 48 








51 54-55 59 63 69 75-76 85-86 








91 93 103 105-106 109 111 113 








119 129 134 141-143 156 159 165 








173 177 179-181 188-191 198 








202-203 210-211 230-232 238-239 








253-259 262 266 268-270 272-273 








275 283 287 293-294 298-300 








304-306 314-315 319-322 325 330 








344 368-369 383 388 393 405 407 








414 


fetal brain 


Clontech 


FBRS0 3 


166 288 344 391-392 


fetal brain 


Invitrogen 


FBT002 


16 23 31 43 48 50 52 61 64 69 








81 86 123 131 134 144 156 162 








166 176 178-179 188-191 197 201 








229 237-238 251 255-256 263 








268-269 272-273 275 285 294-295 








299 301 303 332 342 346 352 365 








368 395-396 398 401-402 406 


fetal heart 


Invitrogen 


FHR0O1 


43 80 294 


fetal kidney 


Clontech 


FKD0O1 


7 45 67-69 123 133 155 157-158 








166 169-170 181 196 202 207 216 








218 237 246 259 261 264 270 273 








277-278 294 332 387 398 


fetal kidney 


Clontech 


FKD002 


123 379 


fetal kidney 


Invitrogen 


FKD007 


81 
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fetal lung 


Clontech 


FLG001 


3 43 45 55 111 132-133 259 270 








404 


fetal lung 


Invitrogen 


FLG0 03 


17 23 31 43 47 75-76 80-81 91 








134 185 204-206 237 255-256 272 








275 287 294-295 314 330-331 377 








406 


fetal lung 


Clontech 


FLG004 


43 317 


fetal liver- 


Scares 


FLS001 


1 3-8 10-14 16-17 20-23 25-29 


spleen 






31-32 37-38 43-45 47-48 50-60 








62-63 65 67-70 74 77-79 81-82 








86 88 91-93 98-99 101 105-106 








110-119 121-123 127-128 130-131 








133-134 136 139-143 146-148 








153-156 160 163-164 166-170 








172-173 175 178 181 186 189-190 








196 204 207-208 212 225 227-228 








230-246 249-250 252-254 257-259 








261-264 268-275 277-279 281-282 








285-287 290 294-295 299 301 








307-309 316-318 325 327-328 332 








334 337 339 341 344 346-352 364 








366 368 370 372-379 382 385-386 








391-393 398 401-402 404-406 410 








412 414 


normalized 


Soares 


FLS002 


1 3-4 7 10-14 16-17 20 22-23 


fetal liver- 






25-26 31 37 43-45 47-48 50 54- 


spleen 






55 58-60 62 67-70 78 80 88-89 








91-93 100-101 106 111-112 115- 








117 119 123 126-128 130 132-133 








136-137 140 144 148 153 157-158 








165 168-170 173 175-177 179 181 








186 189-190 192 194-196 199-200 








202 204 209 213 217 224-225 








227-228 230-242 246 248 251 








253-254 259 261-262 264-265 








268-269 272-273 279 281-282 285 








288 290 292 294 296 299-300 307 








311-312 318-320 322-328 332 334 








337-338 341-343 346 348 350 361 








364 366 368 370-372 376-380 382 








384-387 391-392 394 398-399 








402-403 406 412 414 419 


fetal liver- 


Soares 


FLS003 


19 22-23 50 253-254 290 295 383 


spleen 






385 391-392 401 405 


fetal liver 


Invitrogen 


FLV001 


7 20 23 31 43 51 60-61 67-68 77 








79 91 102 166 217-218 230 232 








234 237 239 252-254 257-259 








268-269 275 285 290 294-296 298 








317-318 322 332 368 370 373 379 








398 401 404 


fetal liver 


Clontech 


FLV0 02 


67-68 283 


fetal liver 


Clontech 


FLV0 04 


4 16 23 25 196 268-269 272 298 








370 394 


fetal muscle 


Invitrogen 


FMS001 


23 52 55 58 106 110 134 154 166 








177 179 189-190 193 198 203 207 






1 229-230 233 253-254 259 290 
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294-295 307 317 352 368 383 390 
404 



fetal muscle 



Invitrogen 



FMS002 



10-11 80 123 154 231 290 295 
319-320 



fetal skin 



Invitrogen 



FSK001 



3-4 7 10-11 15 18 20-21 23 30- 
31 33 35 43 50 52 54 62 64 77 
80 86 89 109 115-116 121-122 
126 130 132 144 148 153 166 171 
181 185-186 188-190 194 205-207 
218 224 226-228 230 234-235 
237-239 242-243 253-254 257-259 
263 268-269 272 275 285 288 290 
294-295 299-300 317 326-327 329 
333-334 342 352 358-360 362-368 
377 381-383 391-393 398 401 405 
418 



fetal skin 



Invitrogen 



FSK002 



3 16 49 54 123 138 169-170 196 
207 210-211 235 246 383 393 401 
408 



fetal spleen 



BioChain 



FSP001 



212 



umbilical cord 



BioChain 



FUC001 



12 16 21 23 28-29 32-33 41-43 
46 48 51 54-55 59 74-76 78-79 
98-99 123 125 133 146 148 150 
153-154 159 164 166 181-182 188 
198 200 207 212 218 222 224 
227-228 231 236-237 248 253-254 
259 261 268-270 272 277-278 
285-288 290 294-295 311 316-317 
327 343-344 346-347 352 366 
368-370 373 377 379 382 391-392 
399-400 402 412 414 418 



fetal brain 



GIBCO 



HFB001 



1 3 7 12 14 
47 50 52 54 
76 80-81 86 
115-116 118 
137 139 141 
155-159 169 
194 198 201 
229 232-233 
253-259 261 
277-279 281 
295 299 301 
341-342 346 
377 379 384 
407 414 418 



19-21 23 31 43-45 
-55 58-61 63 65 74- 
91 94 102 105 111 
-119 121-123 128 132 
-146 149-150 153 
-170 173 179 186 188 
207 209-211 213 218 
236-237 246-247 
-263 265 268-275 
-282 284 286 289 293 
309 311 314 329 332 
349-350 364-365 368 
388 398 400-402 405 



macrophage 



Invitrogen 



HMP001 



186 259 334 414 



infant brain 



Soares 



IB2002 



3 7 10-11 14-15 17 19-20 22-23 
31 37 43-45 51 54-55 60-61 64- 
65 69 73-74 77 80 85-86 89-90 
93-94 102 105 109 114 123 128 
131-132 137 144-145 147 150 
152-153 155-159 164-166 169-170 
173-174 178 180-181 183-184 186 
188-191 193-194 198 210-211 218 
224 227-228 230-233 236-237 239 
242 246-247 251-259 261 263-264 
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267-269 271 

285-286 288 

303-304 306 

342 346 348 

377 383 385 
418-419 



273 275 280-283 

293 295-296 300 

314 316-317 329-330 

-351 361 364 373-375 

388 393 398-401 407 



infant brain 



Soares 



IB2003 



3-4 7 10-12 
51 69 74 80 
137 144-145 
189-190 198 
246 251-254 
271-272 283 
377 388-389 



31 37 39 43 45 48 
91 111 123 131 133 
159 169-171 173 178 
212 232 236 238-239 
257-258 263 268-269 
299 303 332 373-375 
399 404 407 



infant brain 



Soares 



IBM0 02 



25 43 70 89 
286 393 



137 186 242 257-258 



infant brain 



Soares 



IBS001 



3 15 37 137 
317 385 393 



191 231 238 271 299 



lung 

fibroblast 



Strategene 



LFB0 01 



1 21 45 47 
130 141-143 
234 268-270 
344 398 405 



61 63 86 102 

157-158 181 

272 279 284 
414 



113 119 
188 231 
295 341 



lung tumor 



Invitrogen 



LGT0 02 



1 3-4 12 14 
44 47-51 54- 
68 75-76 88 
110-111 113- 
132-133 139 
173 177 185- 
205-207 209- 
232 237-238 
261 263-264 
284-285 287 
317 322 332 
352-353 361 
386-387 391- 
405-406 414 



20 29 31-32 
55 59-60 62 
91 101-102 
114 119-120 
144-145 148 
186 189-190 
214 220-221 
242 248 253 
268-273 276 
292 294-296 
341-342 344 
370 376-377 
393 398 400 



37 43- 
65 67- 
107-108 
123 127 
159 171 
192 197 
223 230 
256 259 
280 

301 314 
346 
383 
402 



lymphocytes 



ATCC 



LPC001 



21 26 37 47 
123 130 144 
186 191 207 
271 279 288 
329 332 350 
402 418 



-48 59 86 91 102 119 
149 155 159 178 181 
220 229 255-256 261 
295 300 311 319-320 
361 366 381 383 387 



leukocyte 



GIBCO 



LUC0 01 



2-3 7-8 10- 
43-44 46-50 
74-76 81 86 
109-110 117 
133 141-143 
165-166 169 
186 189-193 
209-214 218 
236-237 239 
257-259 261 
285-286 293 
314 316-318 
337-342 344 
366 368 370 
391-393 400 



14 17 20-21 24 37-38 
52 54 56 58-61 70 
91 97-99 101 106 

-120 123 125 128-131 
145-147 149 155-159 

-170 173 178-179 181 
200 202-203 207 
227-228 230-233 
242 246-248 253-254 

-275 277-279 281-282 

-295 300 307 310-311 
322 325 330 332 
349-350 352 361 364 
374-377 381 383 

-402 408 414 418 



787CIP2B 



116 



TABLE 1 



leukocyte 


Clontech 


LUC0 03 


21 43 60-61 92 117-119 179 181 








186 200 218 230-231 264 270 274 








277-278 294 308 350 414 


melanoma from 


Clontech 


MEL0 04 


1 10-11 16 37 44 51 59 67-68 78 


cell line ATCC 






86 103 113-114 130 145 149 152 


# CRL 1424 






156 173 181 188 209-211 222 








231-232 237 239 250 263-264 








268-272 274 295 310 316-317 325 








368 377 401 405 414 


mammary gland 


Invitrogen 


MMG0 01 


3 9 12 16 20 23 29 31-32 34 37 








43-44 47 50-52 56 58 60-61 67- 








69 75-76 79 84 86 91 98-102 108 








110 114-117 119 123 126-127 129 








132 134 137 147 149 152-153 156 








164 168 171 173 178-181 187-190 








193 196 204 209 212 214 218 








223-224 229-230 232 234 237-239 








243 248 252-261 263-264 266-269 








271-273 275 285 287-288 290 








294-296 298-299 303 310 316-317 








327 329 332 341-342 344 346 349 








354-356 366 368 370 381 383 385 








391-392 394-396 398 402 404 406 








416 


induced neuron 


Strategene 


NTD0 01 


10-11 20-21 24 37 59 63 67-68 


cells 






89 120 133 144 165 173 178-179 








242 261 279 294 303 398 401 405 








414 


retinoid acid 


Strategene 


NTR0 01 


34 89 123 277-278 295 401 


induced 








neuronal cells 








neuronal cells 


Strategene 


NTU0 01 


10-11 16 28 43 48 50 59 69 75- 








76 123 128 156 179 229 232-233 








261 268-269 287 295 311 385 393 








418 


pituitary 


Clontech 


PIT004 


3 7 22-23 26 75-76 111 141-143 


gland 






286 


placenta 


Clontech 


PLA0 03 


224 


prostate 


Clontech 


PRT0 01 


62 64 91 98-99 102 129 133 139 








148-149 152 154-155 172 186 192 








213 233 253-254 262 270 274 295 








303 350 364 382 414 418 


rectum 


Invitrogen 


REC0 01 


3 33 43 50 69 74 91 95 108 132 








134 169-170 181 194 208 230-231 








234 242 253-258 261 263 266 270 








285 288 298 308 317 336-337 394 


salivary gland 


Clontech 


SAL0 01 


12 26 44 47 49 51 54-55 59 79 








91 108 111 123 132 134 212-213 








229 237-238 250 253-254 295 311 








352 381 383 390 394 401 


saliva gland 


Clontech 


SALS 03 


37 261 


skin 


ATCC 


SFB001 


166 


fibroblast 








skin 


ATCC 


SFB002 


341 


fibroblast 








small 


Clontech 


SIN001 


12 22 24 31 37-38 41-43 46 4.9 
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1 



intestine 






86 115-116 120 133 149 166 169- 








170 174 196 209 230-231 238 250 








253-254 263 275 295 364 384 








386-387 410 414 


skeletal 


Clontech 


SKM001 


59 75-76 123 154 238 271 274 


muscle 






311 345 366 390 


spinal cord 


Clontech 


SPC001 


23 26 29 32 37 43 47 55 58-59 








80 98-99 102 104 115-116 119- 








120 133-135 137 144 146 149 152 








157-158 173 186 188 207 231 236 








255-256 262 270 274 281-282 284 








309 316 364 368 382 389 393-394 








414 


adult spleen 


Clontech 


SPLcOl 


14 52 80 110 207 261 368 


stomach 


Clontech 


STO001 


31 47 74 80 102 110 132 145 149 








176 179 230 294 310 316 322 401 








418 


Mixture of 16 


Various 


SUP002 


33 45 48 110 370 388 398 400- 


tissues- 


Vendors* 




401 414 


mRNAs * 








Mixture of 16 


Various 


SUP005 


16 347 383 


tissues - 


Vendors* 






mRNAs * 








Mixture of 16 


Various 


SUP008 


163 402 


tissues- 


Vendors* 






mRNAs * 








Mixture of 16 


Various 


SUP009 


43 370 


tissues- 


Vendors* 






mRNAs* 








thalamus 


Clontech 


THA002 


23 33 49 51 69 73 84-86 95 115- 








116 134 152 155 176 191 198 221 








263 266 273 275 284 295-296 311 








329 389 


thymus 


Clontech 


THM0 01 


3 7 14 16 21 27 37-38 44 47 60- 








61 69 72 84 101 149 157-158 166 








173 181 186 200 207 209 218 








227-228 232 236 249 259 262 273 








295 337 368 374-376 381 393 419 


thymus 


Clontech 


THMC02 


3-4 7-8 12-13 16-17 21-22 31 








36-38 44 49 51 60 63 75-76 86 








89 91 98-99 103 106 111 113 118 








123 128 130 134 139-140 155 








157-158 160 166 171 175 177 181 








191 200 203 210-211 213 215-219 








230-232 235 239 248 250-251 








253-259 262 264-265 270-272 276 



* The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA 
(Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal fetal liver mRNA (Invitrogen), 7) normal fetal 
skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 10) 
human leukemia lymphablastic mRNA (Clontech), 11) human thymus mRNA (Clontech), 12) human lymph node 
mRNA (Clontech), 13) human spinal cord mRNA (Clontech), 14) human thyroid mRNA (Clontech), 15) human 
esophagus mRNA (BioChain), 16) human conceptional umbilical cord mRNA (BioChain). 
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279 


281- 


282 


292 


296 


298 


308 


322 








325 


330 


337 


339 


-346 


350 


366 


370 








373 


381 


393 


399 


402 


413 


-415 




thvroic] crl and 


Clontech 


THRO 0 1 


3 7 


22 26 30-31 


36 43 47 50 


-51 








54- 


55 58 


-61 


65 


71 74 


-76 


78- 


80 








89 


98-102 111 115-117 123 127- 








128 


130- 


133 


139 


141- 


145 


149 


-150 








154 


-155 


166 


-167 


171- 


174 


181 


186 








189 


-190 


194 


200 


207 


209 


212 


-213 








222 


230 


233 


237 


-239 


243 


248 


250 








257 


-258 


261 


268 


-272 


276 


-279 


284 








286 


293- 


295 


311 


-312 


317 


-322 


327 








335 


339 


341 


-344 


349 


352 


368 


377 








383 


393 


395 


-396 


400 


403 


405 


412 








414 


418- 


419 












trachea 


Clontech 


TRC0 01 


1 62 75- 


76 


115- 


116 149 


151 


153 








230 


236 


261 


2 64 


284 


294 


341 


344 








364 


377 


391 


-392 










uterus 


Clontech 


UTR001 


33 


43 48 


-49 


52 


55 60 


74 


80 


83 








86 


89 111 123 134 139 157-158 








175 


181 


229 


268 


-270 


287 


294 


-295 








325 


414 
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SEQ 
ID NO 


CORRESPONDING 
SEQ ID NO. IN 
U.S. S.N. 
09/560,875 


ACCESSION 
NUMBER 


DESCRIPTION 


SMITH- 
WATERMAN 
SCORE 


% 

IDENTITY 


1 


44 


AL137294 


Homo sapiens 
hypothetical protein 


469 


95 


2 


50 


AJ271684 


Homo sapiens myeloid 
DAP12 -associating 
lectin 


1013 


100 


3 


93 


AL133589 


Homo sapiens 
hypothetical protein 


2707 


99 


4 


224 


U51730 


Murine leukemia virus 
reverse transcriptase 


316 


43 


5 


318 


AB033027 


Homo sapiens KIAA12 01 
protein 


3847 


100 


6 


318 


AB033027 


Homo sapiens KIAA12 01 
protein 


3809 


99 


7 


795 


AF161432 


Homo sapiens HSPC314 


1059 


93 


8 


857 


AB029488 


Homo sapiens Cllorf21 


758 


99 


9 


924 


AJ251024 


Homo sapiens putative 
odorant binding 
protein ag 


1239 


100 


10 


944 


AK0012 84 


Homo sapiens unnamed 
protein product 


2410 


97 


11 


944 


AK001284 


Homo sapiens unnamed 
protein product 


2410 


97 


12 


967 


AE003799 


Drosophila 
melanogaster CG53 23 
gene product 


288 


40 


13 


1055 


AF197927 


Homo sapiens AF5q31 
protein 


3992 


99 


14 


1091 


D28500 


Homo sapiens 
mitochondrial 
isoleucine tRNA 
synthetase 


4286 


98 


15 


1225 


X97868 


Homo sapiens 
aryl sulphatase 


3141 


98 


16 


1257 


AL162048 


Homo sapiens 
hypothetical protein 


1532 


100 


17 


1289 


AL137657 


Homo sapiens 
hypothetical protein 


586 


100 


18 


1292 


AL137662 


Homo sapiens 
hypothetical protein 


1339 


99 


19 


1455 


M15888 


Bos taurus endozepine- 
related protein 
precursor 


2425 


85 


20 


1488 


X66363 


Homo sapiens 
serine/ threonine 
protein kinase 


2562 


100 


21 


1666 


AE003606 


Drosophila 
melanogaster CG1078 
gene product 


412 


30 


22 


1811 


AF100772 


Homo sapiens tenascin- 
Ml 


11535 


99 


23 


1885 


AF090934 


Homo sapiens PRO0518 


382 


100 


24 


1911 


AB021643 


Homo sapiens 


2761 


99 
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gonadotropin inducible 

transcription 

repressor-3 






25 


1935 


Z83123 


Caenorhabditis elegans 
T04A11 .2 


315 


50 


26 


1971 


AF070666 


Homo sapiens Kruppel- 
associated box protein 


466 


97 


27 


1989 


AK000137 


Homo sapiens unnamed 
protein product 


925 


99 


28 


2041 


A58331 


Homo sapiens unnamed 
protein product 


1126 


61 


29 


2178 


AF227906 


Homo sapiens UDP- 
glucose : glycoprotein 
glucosyltransf erase 2 
precursor 


7820 


99 


30 


2237 


AF118566 


Mus mus cuius 
hematopoietic zinc 
finger protein 


1769 


92 


31 


2279 


AK000619 


Homo sapiens unnamed 
protein product 


810 


100 


32 


2338 


AF227906 


Homo sapiens UDP- 
glucose : glycoprotein 
glucosyltransf erase 2 
precursor 


7820 


99 


33 


2351 


AF117946 


Homo sapiens Link 
guanine nucleotide 
exchange factor II 


2363 


100 


34 


2405 


AB032997 


Homo sapiens KIAA1171 
protein 


2946 


99 


35 


2531 


AB032971 


Homo sapiens KIAA114 5 
protein 


2290 


100 


36 


2584 


AE003453 


Drosophila 

melanogaster CG15670 
gene product 


352 


37 


37 


2608 


AF177388 


Homo sapiens cancer- 
amplified 
transcriptional 
coactivator ASC-2 


10748 


99 


38 


2655 


AJ002744 


Homo sapiens UDP- 
GalNAc polypeptide N- 
acetylgalactosaminyltr 
ansferase 7 


3469 


96 


39 


2656 


AF097366 


Homo sapiens cone 
sodium- calcium 
potassium exchanger 


3166 


100 


40 


2866 


AB032958 


Homo sapiens KIAA1132 
protein 


9116 


100 


41 


3015 


AB018299 


Homo sapiens KIAA075 6 
protein 


4445 


99 


42 


3015 


AB018299 


Homo sapiens KIAA075 6 
protein 


4408 


98 


43 


3043 


X13916 


Homo sapiens LDL- 
receptor related 
precursor (AA -19 to 
4525) 


4292 


99 


44 


3986 


AF151083 


Homo sapiens HSPC24 9 


444 


98 
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45 


4647 


AB017026 


Mus musculus 
oxysterol -binding 
protein 


2173 


98 


46 


4659 


D87077 


Homo sapiens KIAA0240 


5024 


100 


47 


5032 


AK001862 


Homo sapiens unnamed 
protein product 


954 


100 


48 


5244 


AB033010 


Homo sapiens KIAA1184 
protein 


2024 


100 


49 


5268 


D86971 


Homo sapiens no 
similarities to 
reported gene products 


840 


45 


50 


5281 


AE001691 


Thermotoga maritima 
conserved hypothetical 
protein 


307 


40 


51 


5282 


AF174602 


Homo sapiens F-box 
protein Fbx2 2 


391 


93 


52 


6147 


Z50853 


Homo sapiens CLPP 


615 


100 


53 


6178 


M2-0153 


Rattus norvegicus 
cytochrome c oxidase 
subunit Vic 


81 


55 


54 


6184 


X03475 


Rattus norvegicus 
ribosomal protein L3 5a 
(aa 1-110) 


576 


99 


55 


6187 


AB029025 


Homo sapiens KIAA1102 
protein 


4394 


100 


56 


6190 


U2 02 3 9 


Mus musculus fibrosin 


288 


76 


57 


6191 


AF125101 


Homo sapiens HSPC04 0 
protein 


70 


100 


58 


6194 


AB018333 


Homo sapiens KIAA07 90 
protein 


6546 


100 


59 


6196 


X54326 


Homo sapiens 
glutaminyl -tRNA 
synthetase 


7577 


99 


60 


6201 


X61585 


Bos taurus 
polynucleotide 
adenylyltransf erase 


3715 


97 


61 


6208 


AB040921 


Homo sapiens KIAA14 88 
protein 


4418 


99 


62 


6214 


AC006370 


Homo sapiens testis - 
specific basic protein 
Y 1 


87 


50 


63 


6217 


AE003585 


Drosophila 

melanogaster CG17652 
gene product 


400 


40 


64 


6220 


AJ245922 


Homo sapiens alpha- 
tubulin 8 


2370 


100 


65 


6221 


AB029001 


Homo sapiens KIAA107 8 
protein 


4418 


100 


66 


6222 


Z97653 


Homo sapiens 

C3 80A1.2.1 (novel 

protein (isoform 1) ) 


1567 


100 


67 


6223 


AJ404326 


Homo sapiens SR+89 


1871 


99 


68 


6223 


AJ404326 


Homo sapiens SR+89 


902 


64 


69 


6226 


AF134726 


Homo sapiens G9A 


5334 


99 


70 


6227 


AE003538 


Drosophila 

melanogaster CG10191 


358 


44 
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gene product 






71 


6229 


M15530 


Homo sapiens B-cell 
growth factor 


89 


40 


72 


6248 


S67970 


Homo sapiens 
ZNF75=KRAB zinc finger 


951 


76 


73 


6260 


L28035 


Mus musculus protein 
kinase C gamma 


3768 


98 


74 


6264 


U16359 


Rattus norvegicus 
nitric oxide synthase 


91 


58 


75 


6269 


AK000566 


Homo sapiens unnamed 
protein product 


1053 


100 


76 


6269 


AK000566 


Homo sapiens unnamed 
protein product 


537 


100 


77 


6275 


X61118 


Homo sapiens TTG- 
2a/RBTN-2a 


876 


100 


78 


6276 


Z96932 


Homo sapiens nuclear 
autoantigen fo 14 kDa 


496 


83 


79 


6280 


AJ277291 


Homo sapiens HELG 
protein 


678 


72 


80 


6287 


X82157 


Homo sapiens hevin 


3525 


99 


81 


6290 


AC007192 


Homo sapiens 

P 8 5 B_HUMAN / PTDINS-3 - 

KINASE P85 -BETA 


3825 


99 


82 


6293 


AL021918 


Homo sapiens b34I8.1 
(Kruppel related Zinc 
Finger protein 184) 


1713 


51 


83 


6305 


AF084464 


Rattus norvegicus GTP- 
binding protein REM2 


141 


30 


84 


6308 


AL049795 


Homo sapiens dJ622L5.2 
(novel protein) 


1756 


98 


85 


6309 


D00189 


Rattus norvegicus 
Na+, K+-ATPase alpha - 
subunit 


4269 


99 


86 


6312 


U33460 


Homo s ap i e n s DNA - 
directed RNA 
polymerase I, largest 
subunit 


8777 


98 


87 


6314 


D87076 


Homo sapiens similar 
to human bromodomain 
protein BR140 (JC2069) 


3067 


100 


88 


6316 


L43912 


Macaca mulatta 
mannose -binding 
protein A 


589 


93 


89 


6336 


AB018341 


Homo sapiens KIAA0798 
protein 


1224 


52 


90 


6341 


U70976 


Homo sapiens arrestin 


2068 


99 


91 


6343 


U80931 


Caenorhabditis elegans 
strong similarity to 
class-Ill of 
pyridoxal -phoshate- 
dependent 
aminotransferases 


1049 


46 


92 


6346 


AE003435 


Drosophila 
melanogaster CG3 016 
gene product 


240 


36 


93 


6357 


AB037860 


Homo sapiens KIAA1439 


2670 


99 
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protein 






94 


6359 


AL354615 


Homo sapiens 
hypothetical protein 


2418 


100 


95 


6367 


AE003789 


Drosophila 
melanogaster CG15234 
gene product 


66 


40 


96 


6383 


AC006729 


Caenorhabditis elegans 
Hypothetical protein 
Y24D9A.e 


120 


31 


97 


6385 


AF070530 


Homo s ap i ens unknown 


2227 


99 


98 


6396 


AJ133352 


Homo sapiens ZNF2 3 7 
protein 


2006 


100 


99 


6396 


AJ133352 


Homo sapiens ZNF2 37 
protein 


1025 


96 


100 


6403 


AF170708 


Homo sapiens T-box 
protein TBX3 


3700 


99 


101 


6405 


AK002080 


Homo s api ens unnamed 
protein product 


1546 


99 


102 


6414 


L32977 


Homo sapiens Rieske 
Fe-S protein 


1239 


93 


103 


6418 


AL080125 


Homo sapiens 
hypothetical protein 


1652 


56 


104 


6422 


AJ271671 


Homo s ap i e ns I RT 1 
protein 


432 


36 


105 


6425 


AB006631 


Homo sapiens The human 
homolog of mouse Cux-2 


6559 


100 


106 


6436 


AF067165 


Homo sapiens zinc 
finger protein 3 


977 


64 


107 


6471 


AF038169 


Homo sapiens unknown 


154 


38 


108 


6474 


X57817 


Homo sapiens 
immunoglobulin lambda 
light chain 


947 


79 


109 


6482 


U97002 


Caenorhabditis elegans 
similar to acyl-CoA 
dehydrogenases and 
epoxide hydrolases; 
Pfam domain PF00441 

(Acyl-CoA_dh) , 
Score=57.4, E- 
value=l . 7e-16 , N=2 ; 
contains similarity to 
Pfam domain PF0 0702 

(Hydrolase) , 
Score=57.4, E- 
value=le-13, N=l 


583 


37 


110 


6504 


AL031282 


Homo sapiens 
d J2 8 3 E3 . 6 . 1 ( PUTATIVE 
novel protein similar 
to many 

(archae) bacterial , 
worm and yeast 
hypothetical proteins) 


1041 


99 


111 


6510 


M93134 


Friend murine leukemia 
virus pol protein 


289 


45 


112 


6515 


AF055473 


Homo sapiens GAGE -8 


232 


47 


113 


6529 


X07383 


Sus scrofa 


60 


38 
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preproendothelin (AA - 
19 to 184) 






114 


6535 


S79410 


Mus musculus nuclear 
localization signal 
binding protein 


99 


42 


115 


6536 


AB020710 


Homo sapiens KIAA0903 
protein 


4919 


100 


116 


6536 


AB020710 


Homo sapiens KIAA0903 
protein 


3334 


75 


117 


6541 


AE003492 


Drosophila 
melanogaster CG1998 
gene product 


799 


47 


118 


6542 


D38552 


Homo sapiens The 
ha!539 protein is 
related to 
cyclophilin . 


2995 


100 


119 


6547 


AF272981 


Homo sapiens cytosolic 
aminopeptidase P 


3305 


100 


120 


6548 


AJ224747 


Homo s api ens C - 
terminal variant of 
hINADL including 2 
amino acid exchanges 
and an insertion of 2 8 
amino acids in frame. 


7969 


100 


121 


6552 


AL080063 


Homo sapiens 
hypothetical protein 


1354 


100 


122 


6552 


AL080063 


Homo sapiens 
hypothetical protein 


998 


98 


123 


6555 


X63526 


Homo sapiens homologue 
to elongation factor 
1-gamma from A.salina 


2273 


99 


124 


6560 


X15940 


Homo sapiens ribos oma 1 
protein L31 (AA 1-125) 


644 


100 


125 


6566 


M60832 


Homo sapiens alpha -2 
type VIII collagen 


3581 


99 


126 


6576 


AF039697 


Homo sapiens antigen 
NY-CO-31 


1213 


97 


127 


6584 


AF156929 


Sus scrofa 

inflammatory response 
protein 6 


1588 


83 


128 


6588 


AF264717 


Homo sapiens FYVE 
domain- containing dual 
specificity protein 
phosphatase FYVE-DSP2 


5610 


99 


129 


6589 


AF044578 


Homo sapiens putative 
DNA polymerase; P0L4P 


2478 


94 


130 


6590 


X89750 


Homo sapiens TGIF 
protein 


1413 


100 


131 


6597 


M93107 


Homo sapiens (R) -3- 
hydroxybu ty r a t e 
dehydrogenase 


1663 


96 


132 


6600 


AB018314 


Homo sapiens KIAA0771 
protein 


4930 


99 


133 


6602 


AF151538 


Homo sapiens 
deoxycytidyl 
transferase; Revip 


4281 


99 
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134 


6604 


AB020716 


Homo sapiens KIAA090 9 
protein 


6344 


99 


135 


6605 


AJ243874 


Homo sapiens 
oligophrenin-4 


3682 


100 


136 


6608 


Z11737 


Homo sapiens flavin- 
containing 
monooxygenase 4 


2969 


100 


137 


6610 


AB002318 


Homo sapiens KIAA032 0 


4639 


99 


138 


6614 


AJ245600 


Homo sapiens 
hypothetical protein 


2616 


99 


139 


6623 


AK001447 


Homo sapiens unnamed 
protein product 


1024 


100 


140 


6629 


AB006624 


Homo sapiens KIAA0286 


2246 


99 


141 


6631 


X90530 


Homo sapiens ragB 


1926 


99 


142 


6631 


X90530 


Homo sapiens ragB 


1405 


99 


143 


6631 


X90530 


Homo sapiens ragB 


1893 


96 


144 


6632 


AL022394 


Homo sapiens 
dJ511B24 . 3 (KIAA0395 
(probable Zinc Finger 
Homeobox protein) ) 


4990 


99 


145 


6633 


Y11395 


Homo sapiens 

s event ran s membrane - 

domain protein 


2168 


100 


146 


6634 


AJ010119 


Homo sapiens Ribosomal 
protein kinase B (RSK- 
B) 


4001 


100 


147 


6635 


AL080157 


Homo sapiens 
hypothetical protein 


2278 


100 


148 


6639 


X54131 


Homo sapiens protein- 
tyrosine phosphatase 


10465 


99 


149 


6649 


AK001972 


Homo sapiens unnamed 
protein product 


1665 


100 


150 


6651 


AC004142 


Homo sapiens similar 
to murine leucine-rich 
repeat protein; 
possible role in 
neural development by 
protein-protein 
interactions; 93% 
similarity to D49802 
(PID:gl369906) 


3676 


100 


151 


6655 


AL117544 


Homo sapiens 
hypothetical protein 


1226 


100 


152 


6658 


AF203032 


Homo sapiens 
neurofilament protein 


5115 


99 


153 


6667 


AL034417 


Homo sapiens 
bK215D11.2 (similar to 
rat gene 33) 


2476 


100 


154 


6672 


X69090 


Homo sapiens 190kD 
protein 


7546 


99 


155 


6682 


AB032957 


Homo sapiens KIAA1131 
protein 


5395 


100 


156 


6683 


AK000486 


Homo sapiens unnamed 
protein product 


801 


100 


157 


6687 


U39045 


Rattus norvegicus 
cytoplasmic dyne in 


3241 


97 
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intermediate chain 2B 






158 


6687 


AF063231 


Mus musculus 
cytoplasmic dynein 
intermediate chain 2 


3159 


97 


159 


6688 


AF202893 


Mus musculus Kif21b 


4336 


95 


160 


6696 


Y13115 


Homo sapiens 
serine/ threonine 
protein kinase 


5071 


99 


161 


6701 


AB030207 


Homo sapiens G gamma 
subunit 


364 


100 


162 


6707 


AL133030 


Homo sapiens 
hypothetical protein 


6174 


99 


163 


6712 


AB037883 


Homo sapiens Gb3/CD77 
synthase 


1916 


99 


164 


6714 


D90868 


Escherichia coli 
similar to 


1489 


100 


165 


6720 


X98834 


Homo sapiens zinc 
finger protein Hsal2 


5290 


100 


166 


6721 


D80007 


Homo sapiens similar 
to hypothetical 
protein YM9959.11C of 
S . cerevisiae . 


9568 


100 


167 


6722 


AF019926 


Mus musculus protein 
kinase 


1694 


90 


168 


6736 


M34513 


Homo sapiens omega 
protein 


317 


91 


169 


6740 


Y08612 


Homo sapiens 8 8kDa 
nuclear pore complex 
protein 


2313 


99 


170 


6740 


Y08612 


Homo sapiens 88kDa 
nuclear pore complex 
protein 


1561 


99 


171 


6760 


AB018310 


Homo sapiens KIAA0767 
protein 


2968 


100 


172 


6775 


AF186249 


Homo sapiens six 
transmembrane 
epithelial antigen of 
prostate 


1790 


100 


173 


6784 


AB029012 


Homo sapiens KIAA10 8 9 
protein 


4933 


100 


174 


6793 


AB026893 


Homo sapiens vascular 
cadherin-2 


5962 


100 


175 


6795 


X74331 


Homo sapiens DMA 
primase (p5 8 subunit) 


1720 


100 


176 


6796 


AC002391 


Arabidopsis thaliana 
hypothetical protein 


72 


30 


177 


6807 


AC007228 


Homo sapiens R31665_2 


1488 


48 


178 


6808 


X14830 


Homo sapiens 
acetylcholine receptor 
beta- subunit 
preprotein 


2639 


100 


179 


6810 


U80446 


Caenorhabditis elegans 
coded for by C. 
elegans cDNA 
ykl72e6.3; coded for 
by C. elegans cDNA 


420 


40 
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ykl5 8f7.3; coded for 
by C. elegans cDNA 
ykl5 8f7,5; coded for 
by C. elegans cDNA 
ykl72e6. 5 






180 


6815 


AB002360 


Homo sapiens KIAA03 62 


5766 


99 


181 


6819 


Z82215 


Homo sapiens dJ6802.2 
(myosin, heavy 
polypeptide 9, non- 
muscle) 


9828 


100 


182 


6821 


AC004010 


Homo sapiens similar 
to Leucine-rich 
transmembrane 
proteins; 44% 
similarity to U42767 
(PID:gl736918) 


2723 


100 


183 


6827 


AJ269537 


Homo sapiens 
chondroitin-4 - 
sulf otransf erase 


557 


40 


184 


6829 


X99688 


Homo sapiens TYL 


3326 


99 


185 


6830 


U90305 


Homo sapiens iroquois- 
class homeodomain 
protein IRX-3 


1022 


99 


186 


6835 


AF180473 


Homo sapiens Wot2p 


2267 


100 


187 


6848 


AB029009 


Homo sapiens KIAA1086 
protein 


4798 


99 


188 


6849 


AF023453 


Homo sapiens actin- 
related protein 3 -beta 


2219 


100 


189 


6851 


AB020699 


Homo sapiens KIAA08 92 
protein 


3222 


100 


190 


6851 


AC003030 


Homo sapiens R29828_l 


2294 


100 


191 


6863 


AB020630 


Homo sapiens KIAA0823 
protein 


2159 


100 


192 


6869 


AK001387 


Homo sapiens unnamed 
protein product 


1255 


99 


193 


6874 


D63484 


Homo sapiens The 
KIAA015 0 gene product 
is novel. 


4940 


99 


194 


6887 


Y07595 


Homo sapiens 
transcription factor 
TFIIH 


2373 


100 


195 


6890 


AF143321 


Homo sapiens unknown 


918 


95 


196 


6894 


X83618 


Homo sapiens 
hydroxymethylglutaryl - 
CoA synthase 


2647 


100 


197 


6899 


AF134726 


Homo sapiens NG3 7 


4359 


99 


198 


6900 


AB035356 


Homo sapiens neurexin 
I-alpha protein 


6948 


99 


199 


6903 


Z68005 


Caenorhabditis elegans 
F59F3.4 


67 


37 


200 


6910 


AB018323 


Homo sapiens KIAA07 80 
protein 


5220 


99 


201 


6913 


AB043634 


Homo sapiens PAR-6A 


885 


100 


202 


6918 


AP000693 


Homo sapiens partial 
CDS 


4875 


99 


203 


6923 


AF002223 


Homo sapiens 


3490 


100 
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myotubularin related 1 






204 


6926 


AC004893 


Homo sapiens similar 
to NEDD-4 (KIA0093); 
similar to P46934 
(PID:gll71682) 


1611 


100 


205 


6929 


AB002355 


Homo sapiens KIAA035 7 


10119 


100 


206 


6929 


AF257737 


Homo sapiens ciliary 
dyne in heavy chain 9 


11126 


99 


207 


6932 


X65873 


Homo sapiens kinesin 
heavy chain 


4860 


100 


208 


6941 


AL080159 


Homo sapiens 
hypothetical protein 


1942 


100 


209 


6951 


AC004883 


Homo sapiens similar 
to KIAA0766; similar 
to PID:g3882253 


2782 


99 


210 


6954 


X71125 


Homo sapiens 
glutaminyl -peptide 
eye lot ransf erase 


1914 


100 


211 


6954 


X71125 


Homo sapiens 
glutaminyl -peptide 
eye lot ransf erase 


1456 


97 


212 


6956 


X54304 


Homo sapiens myosin 
regulatory light chain 


897 


100 


213 


6957 


AB007883 


Homo sapiens KIAA042 3 


8621 


99 


214 


6960 


AK000432 


Homo s ap i ens unnamed 
protein product 


1622 


100 


215 


6966 


AF039563 


Mus musculus 
retinoblastoma binding 
protein 


792 


94 


216 


6968 


AL117352 


Homo sapiens 
dJ876B10.2 (novel 
protein (ortholog of 
rat EX084) ) 


3713 


99 


217 


6969 


AF228603 


Homo sapiens 
pleckstrin 2 


1841 


100 


218 


6970 


AL117455 


Homo sapiens 
hypothetical protein 


4598 


100 


219 


6971 


AK002077 


Homo s ap i ens unnamed 
protein product 


989 


100 


220 


6989 


AF156271 


Homo sapiens RING 
finger protein terf 


476 


40 


221 


6990 


AC005551 


Homo sapiens R26529_2, 
partial CDS 


1020 


100 


222 


6994 


AL080129 


Homo sapiens 
hypothetical protein 


3789 


99 


223 


6996 


AE003603 


Drosophila 
melanogaster CG1172 
gene product 


512 


54 


224 


6997 


AB002379 


Homo sapiens KIAA0381 


4437 


99 


225 


7009 


U03399 


Homo sapiens T- complex 
protein 10A 


846 


77 


226 


7016 


AB014091 


Xenopus laevis alpha - 
1 - ant iproteinase 


850 


42 


227 


7023 


AB029040 


Homo sapiens KIAA1117 
protein 


7032 


99 


228 


7023 


AL121716 


Homo sapiens 


6329 


99 
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dJ202D23.2 (novel 
protein) 






229 


7035 


X92715 


Homo sapiens KRAB 
/C2H2 zinc finger 
protein 


3102 


97 


230 


7038 


X54637 


Homo sapiens protein 
tyrosine kinase 


5564 


98 


231 


7039 


AB007925 


Homo sapiens KIAA0456 
protein 


5491 


99 


232 


7040 


AJ251245 


Rattus norvegicus 
SECIS binding protein 
2 


3086 


72 


233 


7041 


AF113125 


Homo sapiens E-l 
enzyme 


581 


100 


234 


7044 


M19529 


Sus scrofa follistatin 
A 


1906 


98 


235 


7059 


AB040902 


Homo sapiens KIAA1469 
protein 


3403 


100 


236 


7060 


D87685 


Homo sapiens similar 
to human transcription 
factor TFIIS (S34159) . 


8083 


99 


237 


7063 


AB029334 


Halocynthia roretzi 
HrPET-1 


638 


35 


238 


7067 


AK001603 


Homo sapiens unnamed 
protein product 


2079 


99 


239 


7070 


D86973 


Homo sapiens similar 
to Yeast translation 
activator GCN1 
(P1:A48126) 


12033 


99 


240 


7071 


AL034452 


Homo sapiens 
dJ682J15.1 (novel 
Collagen triple helix 
repeat containing 
protein) 


1979 


100 


241 


7079 


AF088219 


Homo sapiens lysozyme 
homo log 


307 


42 


242 


7085 


AF007872 


Homo sapiens torsinB 


1277 


98 


243 


7148 


U94586 


Homo sapiens 
NADH : ubiquinone 
oxidoreductase MLRQ 
subunit 


277 


66 


244 


7156 


U28486 


Mus musculus proline- 
rich acidic protein 


313 


48 


245 


7156 


U28486 


Mus musculus proline- 
rich acidic protein 


297 


47 


246 


7171 


AF022770 


Mus musculus 
peripherial 
benzodiazepine 
receptor associated 
protein; PBR 
associated protein; 
PAP 7 


2078 


91 


247 


7241 


AC004990 


Homo sapiens supported 
by Gens can and several 
ESTS: C83049 
(NID:g3062006) , 


2685 


99 
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AA823760 

(NID:g2893628) , 
AA215791 

(NID:gl815572) , 
AI095488 

(NID:g3434464) , and 
AA969095 
(NID:g3144275) 






248 


7265 


AB037756 


Homo sapiens KIAA13 3 5 
protein 


3590 


99 


249 


7268 


AF070447 


Homo sapiens barrier - 

to-autointegration 

factor 


290 


90 


250 


7308 


AB020719 


Homo sapiens KIAA0912 
protein 


6488 


100 


251 


7336 


X79828 


Mus mus cuius NK10 


202 


53 


252 


7347 


AB004109 


Cricetulus griseus 
phosphatidylserine 
synthase II 


2262 


92 


253 


7405 


AB028972 


Homo s ap i en s KIAA1 04 9 
protein 


2866 


99 


254 


7405 


AB028972 


Homo sapiens KIAA104 9 
protein 


2754 


97 


255 


7412 


AF006264 


Homo sapiens 
recombination and 
sister chromatid 
cohesion protein 
homo log 


2850 


100 


256 


7412 


AF006264 


Homo sapiens 
recombination and 
sister chromatid 
cohesion protein 
homo log 


2530 


100 


257 


7436 


X82260 


Homo sapiens RanGAPl 


2929 


100 


258 


7436 


X82260 


Homo sapiens RanGAPl 


1843 


97 


259 


7454 


AF160909 


Drosophila 
melanogaster 
BcDNA. LD034 71 


943 


59 


260 


7476 


AF225324 


Centrocercus 
urophasianus gag 
polyprotein 


94 


33 


261 


7598 


X74801 


Homo sapiens gamma 
subunit of CCT 
chaperonin 


2745 


99 


262 


7619 


AL031427 


Homo sapiens 
dJ167A19 . 1 (novel 
protein) 


1608 


100 


263 


7644 


AF258614 


Canis familiaris 
vacuolar proton-ATPase 
subunit ATP6H 


345 


72 


264 


7648 


X16396 


Homo sapiens precursor 
polypeptide (AA -2 9 to 
315) 


1749 


100 


265 


7659 


AK000281 


Homo sapiens unnamed 
protein product 


1814 


99 


266 


7661 


AB011128 


Homo sapiens KIAA0556 


5761 


99 I 
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protein 






267 


7669 


AL050284 


Homo sapiens 
hypothetical protein 


2857 


100 


268 


7686 


D26068 


Homo sapiens KIAA003 8 


1208 


100 


269 


7686 


AF045555 


Homo sapiens wbscrl 
alternative spliced 
product 


1318 


100 


270 


7694 


U22229 


Felis catus ribosomal 
protein L41 


128 


100 


271 


7697 


Y17169 


Homo sapiens A6 
related protein 


1819 


100 


272 


7733 


Y12065 


Homo sapiens hNop56 


2956 


99 


273 


7734 


AF177758 


Homo sapiens ubiquitin 
specific protease 16 


2998 


100 


274 


7744 


AB020681 


Homo sapiens KIAA0874 
protein 


3090 


100 


275 


7751 


AL050297 


Homo sapiens 
hypothetical protein 


1227 


99 


276 


7756 


AC007204 


Homo sapiens 
BC273239 1 


3398 


99 


111 


7761 


Z68747 


Homo sapiens imogen 3 8 


2024 


99 


278 


7761 


Z68747 


Homo sapiens imogen 3 8 


1958 


97 


279 


7776 


AK001467 


Homo sapiens unnamed 
protein product 

XT XT 


3416 


99 


280 


7783 


AF095687 


Homo sapiens brain and 

nasopharyngeal 

carcinoma 

susceptibility protein 
NSG-x 


409 


100 


281 


7800 


AF119664 


Homo sapiens 
transcriptional 
regulator protein 
HCNGP 


1574 


100 


282 


7800 


AF119664 


Homo sapiens 
transcript ional 
regulator protein 
HCNGP 


1150 


89 


283 


7801 


Y17849 


Homo sapiens 

gangl ios ide - induced 

differentiation 

associated protein 1 


1839 


98 


284 


7811 


X55740 


Homo sapiens 5 ' - 
nucleotidase 


3012 


100 


285 


7817 


AF039688 


Homo sapiens antigen 
NY-CO-3 


931 


100 


286 


7821 


AL050282 


Homo sapiens 
hypothetical protein 


2424 


100 


287 


7822 


AB007836 


Homo sapiens Hie -5 


2544 


100 


288 


7841 


AB002301 


Homo sapiens KIAA03 03 


11166 


100 


289 


7847 


U32305 


Caenorhabditis elegans 
B0336.ll gene product 


233 


26 


290 


7880 


AF078844 


Homo sapiens hqp0376 
protein 


416 


81 


291 


7910 


U2 83 77 


Escherichia coli 
ORFJE23 9; was ORF__fl91 
and ORF fl94 before 


1198 


100 
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1 









splice 






292 


7925 


AL122047 


Homo sapiens 
hypothetical protein 


404 


100 


293 


7936 


AL080123 


Homo sapiens 
hypothetical protein 


3321 


99 


294 


7945 


U58682 


Homo sapiens ribosomal 
protein S28 


340 


100 


295 


7948 


X57432 


Rattus rattus 
ribosomal protein S2 


1520 


98 


296 


7963 


AB002348 


Homo sapiens KIAA0350 
protein 


5186 


99 


297 


7984 


AC000098 


Arabidopsis thaliana 
YUP8H12 . 9 


37 


46 


298 


7985 


AF110645 


Homo sapiens candidate 
tumor suppressor p33 
ING1 homo log 


843 


69 


299 


8014 


AB011168 


Homo sapiens KIAA0 596 
protein 


6279 


99 


300 


8025 


AJ237839 


Homo sapiens 
hypothetical protein 


11699 


99 


301 


8029 


AP0O1633 


Oryza sativa ESTs 
AU083470 (E11568) ,C2013 
6(E11568) correspond 
to a region of the 
predicted 
gene . -Similar to 
Arabidopsis thaliana 
chromosome II BAC 
F16M14 genomic 
sequence ; unknown 
protein (AC003 02 8) 


424 


43 


302 


8043 


AL163201 


Homo sapiens tensin, 
putative protein- 
tyrosine phosphatase, 
EC 3 .1.3.48 


74 


35 


303 


8164 


AC002550 


Homo sapiens Unknown 
gene product 


858 


99 


304 


8175 


U26592 


Homo sapiens diabetes 
mellitus type I 
autoantigen 


253 


66 


305 


8250 


X60155 


Homo sapiens zinc 
finger 41 


4349 


100 


306 


8253 


X53330 


Platynereis dumerilii 
H4 protein (AA 1 - 
103) 


523 


100 


307 


8255 


AC003682 


Homo sapiens R2 7 94 5_2 


2558 


100 


308 


8258 


X80473 


Mus musculus rabl9 


596 


56 


309 


8270 


J02649 


Rattus norvegicus 
unknown protein 


201 


95 


310 


8271 


AC006014 


Homo sapiens similar 
to RFP transforming 
protein; similar to 
P14373 (PID:gl32517) 


1331 


99 


311 


8272 


X92972 


Homo sapiens protein 
phosphatase 6 


1666 


100 


312 


8279 


L35269 


Homo sapiens zinc 


2803 


99 
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finger protein 






313 


8284 


AC003682 


Homo sapiens F18547 1 


3184 


96 


314 


8285 


X79204 


Homo sapiens ataxin-1 


4195 


99 


315 


8304 


X17620 


Homo sapiens Nm23 
protein 


965 


99 


316 


8309 


AB029004 


Homo sapiens KIAA10 81 
protein 


2362 


100 


317 


8320 


Z56281 


Homo sapiens 
interferon regulatory 
factor 3 


2331 


100 


318 


8331 


Z73906 


Caenorhabditis elegans 
Similarity to 
B.subtilis YQJC 
protein 

(TR:G13Q3954) -CDMA EST 
EMBL:T01187 comes from 
this gene 


470 


70 


319 


8332 


270200 


Homo sapiens U5 snRNP- 
specific 200kD protein 


8819 


99 


320 


8332 


Z702O0 


Homo sapiens U5 snRNP- 
specific 200kD protein 


8589 


97 


321 


8335 


AF153450 


Manduca sexta juvenile 
hormone esterase 
binding protein 


225 


32 


322 


8337 


AF227198 


Homo sapiens CDC2- 
related protein kinase 
7 


7231 


99 


323 


8353 


X99586 


Homo sapiens SMT3C 
protein 


441 


87 


324 


8355 


Y18198 


Homo sapiens OMECUT-2 
transcription factor 
(0C-2) 


2592 


100 


325 


8358 


AK001550 


Homo sapiens unnamed 
protein product 


3654 


99 


326 


8361 


AF139471 


Homo sapiens putative 
calcium- activated 
potassium channel 
regulatory subunit 


1385 


100 


327 


8369 


AL110479 


Caenorhabditis elegans 
predicted using 
Gene finder-cDNA EST 
yk524f8.5 comes from 
this gene-cDNA EST 
yk631e2.5 comes from 
this gene-cDNA EST 
EMBL:C0 8 367 comes from 
this gene-cDNA EST 
yk524f8.3 comes from 
this gene 


709 


48 


328 


8385 


AF007826 


Homo sapiens bax 
epsilon 


133 


66 


329 


8397 


AE003544 


Drosophila 
melanogaster CG11712 
gene product 


514 


47 


330 


8414 


AE003536 


Drosophila 
melanogaster CG17365 


267 


35 
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gene product 






331 


8431 


U83115 


Homo sapiens non-lens 
beta gamma-crystallin 
like protein 


8569 


99 


332 


8433 


AF203687 


Homo sapiens prolactin 
regulatory element - 
binding protein 


2181 


100 


333 


8444 


M27685 


Mus musculus ultra- 
high sulphur keratin 


650 


59 


334 


8446 


U04968 


Cricetulus griseus 
nucleotide excision 
repair protein 


3712 


97 


335 


8460 


AF188181 


Homo sapiens G-protein 
gamma 12 subunit 


356 


100 


336 


8478 


AF117587 


Manduca sexta unknown 


255 


74 


337 


8490 


X67699 


Homo sapiens CDw52 
antigen 


297 


100 


338 


8505 


AF022789 


Homo sapiens ubiquitin 
hydro ly zing enzyme I 


1892 


100 


339 


8523 


AJ001006 


Mus musculus EMeg32 
protein 


938 


96 


340 


8530 


U31332 


Homo sapiens DP 
prostanoid receptor 


1467 


100 


341 


8533 


AF019661 


Mus musculus zeta 
proteasome chain; 
PSMA5 


1214 


100 


342 


8534 


AF156557 


Homo sapiens stomatin 
related protein 


2036 


100 


343 


8536 


AK000438 


Homo sapiens unnamed 
protein product 


593 


100 


344 


8537 


AF161512 


Homo sapiens HSPC163 


738 


100 


345 


8543 


AL031115 


Homo sapiens ZXDA, 
ZXDB (zinc finger X- 
linked protein) 


4298 


100 


346 


8546 


L40410 


Homo sapiens thyroid 
receptor interactor 


806 


100 


347 


8553 


AC004542 


Homo sapiens 
OXYSTEROL - B INDING 
PROTEIN- like; similar 
to P22059 
(PID:gl29308) 


2533 


99 


348 


8556 


AB002298 


Homo sapiens KIAA03 00 


8265 


99 


349 


8561 


AE003691 


Drosophila 
melanogaster CG5276 
gene product 


875 


53 


350 


8562 


AK000051 


Homo sapiens unnamed 
protein product 


1613 


100 


3 O I 


8569 


U10362 


Homo sapiens GP36b 
glycoprotein 


790 


57 


352 


8587 


AL133506 


Unknown 

/prediction^ (method : " " 
genscan" " , 
version : " " 1 . 0 " " , 
score: ""109.13" ") ; 
/prediction- (method: 


825 


48 


353 


8597 


X61001 | Gallus gallus lysozyme 


422 


42 
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"3 C A 


8610 


AC003034 


Homo sapiens Gene with 
similarity to rat 
kidney- specif ic (KS) 
gene 


1190 


100 


J 3 O 


oblU 


AC003034 


Homo sapiens Gene with 
similarity to rat 
kidney- specific (KS) 
gene 


937 


95 


"2 C £ 
J O D 


DDlD 


A J2 42 832 


Homo sapiens calpain 


3756 


100 


357 


8622 


S52624 


Homo sapiens orf 5 ' of 
PAF receptor 


185 


100 


358 


8626 


AF005081 


Homo sapiens skin- 
specific protein 


652 


100 


ICQ 


o o 


Y16793 


Homo sapiens keratin, 
type I 


2232 


100 


360 


8629 


AF005080 


Homo sapiens skin- 
specific protein 


438 


73 


361 


8630 


AK001429 


Homo sapiens unnamed 
protein product 


682 


99 


j o ^ 


Q £Z O O 


AFO05 08O 


Homo sapiens skin- 
specific protein 


375 


62 




odj4 


Y16132 


Homo sapiens CDT6 


1874 


100 


: 3 64 

} 


8643 


D64048 


Rattus norvegicus 
phosphatidylinositol 
3 -kinase p45 subunit 


2347 


97 


J D D 


8 644 


X63422 


Homo sapiens H(+)- 
transporting ATP 
synthase 


209 


100 


j 366 

1 


8645 


M73837 


Homo sapiens modulator 
recognition factor 2 


1284 


100 


367 


8646 


X52563 


Bos taurus permability 
increasing protein 


383 


29 


jj jDO 

? 


o t: r 1 


X82385 


Homo sapiens RNA 
polymerase II subunit 


429 


95 




Q /T /T 1 


A J2 45586 


Homo sapiens KRAB 
protein domain 


396 


98 


J / U 


8670 


AF223466 


Homo sapiens HT015 
protein 


1311 


97 


"2 "7 "1 
O / ± 


8692 


AF250878 


Salmonella typhi 
plasmid maintenance 
protein 


50 


40 


372 


8698 


AF117383 


Homo sapiens placental 
protein 13/ PP13 


746 


100 


7 *7 *Z 


8762 


AK000192 


Homo sapiens unnamed 
protein product 


1446 


99 


7 7 A 


o /bo 


AK0 01715 


Homo sapiens unnamed 
protein product 


715 


99 


375 


8768 


AL163815 


«.! dDiuupbifa cnajLiana. 
putative protein 


3 02 


67 


376 


8799 


AL031685 


Homo sapiens 
dJ963K2 3.2 (novel 
protein) 


343 


48 


377 


8806 


AB023191 


Homo sapiens KIAA0974 
protein 


2953 


100 


378 


8809 


AL031778 


Homo sapiens dJ34B21.1 
(novel BZRP 


92 0 


100 
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(benzodiazapine 
receptor (peripheral ) 
(MBR, PBR, PBKS, IBP , 
I soquinoline -binding 
protein) ) LIKE 
protein) 






379 


8814 


AL021939 


Homo sapiens 
dJ352A2 0 . 2 (aldehyde 
dehydrogenase family 
protein) 


1764 


100 


380 


8822 


AB037784 


Homo sapiens KIAA13 63 
protein 


1895 


98 


381 


8833 


Y08565 


Homo sapiens UDP- 
GalNAc polypeptide N- 
acetylgalactosaminyltr 
ansf erase 


3331 


99 


382 


8835 


AB037755 


Homo sapiens KIAA13 34 
protein 


1655 


10O 


383 


8877 


AB037811 


Homo sapiens KIAA13 90 
protein 


1563 


99 


384 

i 


8886 


AK00 07 67 


Homo sapiens unnamed 
protein product 


2470 


10O 


385 

\ 


J \J KJ j 


17 84 32 


Homo sapiens SH3 
protein 


3302 


100 


* 386 


9157 


U60269 


Homo sapiens putative 
envelope protein; orf 
similar to env of Type 
A and Type B 
retroviruses and to 
class II HERVs 


893 


95 


387 


9175 


AF015454 


Xenopus laevis ER1 


1716 


80 


388 


9205 


U09355 


Oryctolagus cuniculus 
protein phosphatase 
2A1 B gamma subunit 


2352 


99 


389 


9260 


X89480 


Sus scrofa 

transmembrane protein 


601 


80 


390 


9295 


AB027003 


Mus musculus protein 
phosphatase 


378 


84 


391 


9307 


AF112200 


Homo sapiens NADH- 
oxidoreductase B18 
subunit 


739 


100 


3 92 




AJt 1 1122 00 


Homo sapiens NADH- 
oxidoreductase B18 
subunit 


613 


88 


393 


9312 


AT 1 QO C ^ C 
AJ_I X J J O D 5 


Homo sapiens 
hypothetical protein 


6331 


100 


3 94 


934 7 


Ah U U 3 5 1 1 


Drosophila 
melanogaster CG14194 
gene product 


825 


48 


395 


9370 


L27479 


Homo, sapiens X123 


1237 


99 


396 


9370 


L27479 


Homo sapiens X123 


1206 


97 


397 


9382 


AF182066 


Mus musculus Bv8 
variant 3 precursor 


293 


51 


398 


9591 


AJ001019 


Homo sapiens ring 
finger protein 


1292 


99 


399 


9650 


X68453 


Sus scrofa tubulin- 


1882 


94 
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tyrosine ligase 






400 


9655 


AK001723 


Homo sapiens unnamed 
protein product 


624 


56 


401 


9663 


AF151069 


Homo sapiens HSPC235 


1694 


96 


402 


9715 


AF043695 


Caenorhabditis elegans 
Similar to 

mitochondrial carrier 
protein 


556 


43 


403 


9755 


AK000149 


Homo sapiens unnamed 
protein product 


2589 


99 


404 


9766 


AF130979 


Homo sapiens SH3 
domain-containing 
protein 6511 


2249 


99 


405 


9771 


AL008635 


Homo sapiens 
dJ510H16.2 (high- 
mobility group protein 
2 -like 1) 


3026 


99 


406 


9784 


AL137593 


Homo sapiens 
hypothetical protein 


1124 


100 


407 


9925 


AF111179 


Rattus norvegicus G- 
septin alpha 


1739 


98 


408 


9970 


AF004161 


Oryctolagus cuniculus 
peroxisomal Ca- 
dependent solute 
carrier 


2371 


95 


409 


9997 


Z19585 


Homo sapiens 
thrombospondin-4 


4239 


100 


410 


10008 


AB033053 


Homo sapiens KIAA122 7 
protein 


1857 


100 


411 


10010 


AC005534 


Homo sapiens supported 
by human ESTs AA4124 02 
(NID:g2 070990) NH44 021 
(NID:gll82549) , mouse 
EST AA065933 
(NID;gl562789) , and 
genscan 


756 


100 


412 


10023 


AF090326 


Mus musculus AE-1 
binding protein AEBP2 


1540 


97 


413 


10043 


U3 9412 


Homo sapiens alpha 
SNAP 


879 


81 


414 


10093 


AC003007 


Homo sapiens Unknown 
gene product (partial) 


649 


96 


415 


10172 


U66372 


Bos taurus ribosomal 
protein S29 


230 


73 


416 


10184 


AB032910 


Hylobates muelleri 
dopamine receptor D4 


83 


46 


417 


10205 


AF225903 


Homo sapiens Dl 
dopamine receptor 
interacting protein 
calcyon 


1152 


100 


418 


10246 


AE003486 


Drosophila 
melanogaster CG1749 
gene product 


1291 


65 


419 


10298 


X92666 


Bos taurus cysteine 
string protein 


1085 


98 



787CIP2B 



